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Foreword 


In 1962 a committee of the Division of Mathematics of the 
National Research Council presented to the Space Science Board of 
the National Academy of Sciences a report on the current and 
anticipated uses of mathematics in space activities. A key finding 
was the need for much more intensive work in various mathematical 
areas, both to develop more powerful results in these areas and to 
discover ways of using the existing knowledge more effectively. The 
Summer Seminar of 1963 was planned to fill this need in part. It was 
the third Summer Seminar in Applied Mathematics sponsored by 
the American Mathematical Society. 

In order to keep the Seminar within bounds and to achieve 
reasonable coherence, many mathematical areas of great importance 
and urgency for space activity were not considered at the Seminar. 
The most notable omissions are the area of nonlinear differential 
equations, which is of use in the study of guidance systems for 
propulsive space vehicles, and various areas of statistics, such as 
those involved in the design of experiments to be performed in space, 
analysis of large amounts of data from experiments aboard space 
vehicles, etc. 

Primarily, the topics considered at the Seminar were those having 
to do with the behavior of nonpropulsive space vehicles. Inevit- 
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ably, this led to heavy emphasis on the theory of orbits. For this 
reason, the Seminar was combined with the fifth annual Summer 
Institute in Dynamical Astronomy. Four previous Summer Institutes 
in Dynamical Astronomy have been sponsored by Yale University, 
in the summers of 1959 through 1962. They were devoted primarily 
to topics in the theory of orbits, and were more restricted in scope 
than the present Seminar. 

The orbits of most present and planned space vehicles differ so 
markedly from the orbits of the classical celestial bodies that only 
parts of the classical theories of celestial mechanics are applicable. 
Much effort is being expended to determine which of the classical 
methods are applicable, to find suitable modifications of some of the 
classical methods to make them more widely applicable, and to find 
new methods to handle novel situations. 

Such matters were a main concern of the four previous Summer 
Institutes in Dynamical Astronomy, and received a considerable 
amount of attention at the present Seminar; it was to assure this 
that the Seminar was combined with the fifth Institute. Notes were 
compiled for the Institute of 1959, but their supply is exhausted. 
Notes for the Institute of 1960 were edited by William E. Felling 
and published in 1961 by The McDonnell Aircraft Corporation, 
St. Louis, Missouri, under the title, “Notes of the Summer Institute 
in Dynamical Astronomy at Yale University, July 1960”. Copies 
of these notes can be obtained from The McDonnell Aircraft 
Corporation. 

The Institute of 1961 was held at Tucson, Arizona, and the 
Institute of 1962 was held at Yale University. Only scattered notes, 
mostly in mimeographed form, survive from these meetings. Some 
of these were collected and have been included in the present 
Proceedings. Specifically, from the Institute of 1961 come the 
chapters: Precession and nutation by Alan Fletcher; Lectures on 
regularization by Paul B. Richards. From the Institute of 1962 come 
the chapters: Problems of stellar dynamics by G. Contopoulos; Matrix 
methods by J.M.A. Danby; The effect of radiation pressure on the 
motion of an artificial satellite by Gen-ichiro Hori; The two variable 
expansion procedure for the approximate solution of certain nonlinear 
differential equations by J. Kevorkian; Stability and small oscillations 
about equilibrium and periodic motions by P. J. Message; The 
spheroidal method in satellite astronomy by John P. Vinti. 
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Some of the lectures of the earlier Institutes, and of the present 
Seminar, have been published elsewhere, either separately or as 
parts of larger works. Indeed, some lectures excerpted particularly 
relevant material already in print. Besides the Notes of the 1960 
Institute already cited, a list of publications containing material 
not covered in the present Proceedings which was presented in 
lectures at the Institutes and the Seminar follows: 

R. F. Arenstorf, Periodic solutions of the restricted three body prob- 
lem presenting analytic continuations of Keplerian motions, Amer. J. 
Math 85(1963), 27-35. 

V. I. Arnol'd, The stability of the equilibrium position of a Hamil- 
tonian system of ordinary differential equations in the general elliptic 
case, Dokl. Akad. Nauk SSSR 137(1961), 255-257 = Soviet Math. 
Dokl. 2(1961), 247. 

, Generation of quasi-periodic motion from a family of periodic 

motions, Dokl. Akad. Nauk SSSR 138(1961), 13-15 = Soviet Math. 
Dokl. 2(1961), 501. 

, The classical theory of perturbations and the problem of 

stability of planetary systems, Dokl. Akad. Nauk SSSR 145(1962), 
487-490 = Soviet Math. Dokl. 3(1962), 1008. 

, Proof of A. N. Kolmogorov's theorem on the preservation of 

quasi-periodic motions under small perturbations of the Hamiltonian, 
Uspehi Mat. Nauk SSSR 18, Ser. 5 (113), 1963, pp. 13-40. 

, Small divisor and stability problems in classical and celestial 

mechanics, Uspehi Mat. Nauk SSSR 18, Ser. 6(119), 1963, pp. 81-192. 

R. E. Bellman, Adaptive control processes; a guided tour, Princeton 
Univ. Press, Princeton, N. J., 1961. 

R. E. Bellman and S. Dreyfus, Applied dynamic programming, 
Princeton Univ. Press, Princeton, N. J., 1962. 

D. Brouwer, Solution of the problem of artificial satellite motion 
without drag, Astronom. J. 64(1959), 378-397. 

D. Brouwer and G. M. Clemence, Methods of celestial mechanics, 
Academic Press, New York, 1961. 

D. Brouwer and G. Hori, Theoretical evaluation of atmospheric 
drag effects in the motion of an artificial satellite, Astronom. J. 
66(1961), 193-225; 264-265. 

C. J. Cohen and E. C. Hubbard, A non-singular set of orbit elements, 
Astronom. J. 67(1962), 10-15. 

C. C. Conley, A disk mapping associated with the satellite problem, 
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Comm. Pure Appl. Math. 17(1964), 237-243. 

S. Dreyfus, Dynamic programming and the calculus of variations, 

J. Math. Anal. Appl. 1(1960), 228-239. 

W. J. Eckert and D. Brouwer, The use of rectangular coordinates in 
the differential correction of orbits, Astronom. J. 46(1937), 125-132. 

B. Garfinkel, Motion of a satellite in the vicinity of the critical in- 
clination, Astronom. J. 65(1960), 624-627. 

Y. Hagihara, Theories of equilibrium figures of a homogeneous rotat- 
ing fluid mass, Blaisdell, New York, forthcoming. 

, Celestial mechanics, Blaisdell, New York, forthcoming in 

four volumes. 

Paul Herget, The computation of orbits, Observatory of the Univer- 
sity of Cincinnati, 1948. 

, Computation of preliminary orbits, Astronom. J. 70(1965), 

i-2. 

G. Hori, Motion of an artificial satellite in the vicinity of the critical 
inclination, Astronom. J. 65(1960), 291-300. 

A. N. Kolmogorov, Dokl. Akad. Nauk SSSR 98(1954), 527-530. 

, General theory of dynamical systems and classical mechan- 
ics, Proc. Internat. Congr. Math., Amsterdam, 1954, (Amsterdam: 
Erven P. Nordhoff, 1957), Vol. 1, pp. 315-333. 

W. T. Kyner, Qualitative properties of orbits about an oblate planet, 
Comm. Pure Appl. Math. 17(1964), 227-236. 

t A mathematical theory of the orbits about an oblate planet, 

J. Soc. Indust. Appl. Math. 13(1965), 136-171. 

J. Moser, Nonexistence of integrals for canonical systems of differ- 
ential equations, Comm. Pure Appl. Math. 8(1955), 409-436. 

, On invariant curves of area-preserving mappings of an 

annulus, Nachr. Akad. Wiss. Gottingen Math.-Phys. Kl., No. 1 
(1962), 1-20. 

, Stability and nonlinear character of ordinary differential 

equations, Proc. Sympos. on Nonlinear problems, 1962, Univ. o t 
Wisconsin Press, Madison, Wis., 1963, pp. 139-150. 

P. Musen, Special perturbations of the vectorial elements, Astronom. 
J. 59(1954), 262-267. 

S. Ostrach, Melting ablation, to appear in 1966 as one of the Inter- 
national Series of Monographs on Interdisciplinary and Advanced 
Topics in Science and Engineering, Pergamon Press, Oxford. 

C. L. Siegel, Iteration of analytic functions , Ann. of Math. 43(1942), 
607-612. 
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, Uber die Existenz einer Normalform analytischer Hamilton - 

scher Differentialgleichungen in der Nahe einer Gleichgewichtlosung, 
Math. Ann. 128(1954), 144-170. 

Together with the above references, the present Proceedings 
should acquaint the reader with the current state of research on the 
behavior of nonpropulsive space vehicles, indicate the more pressing 
unsolved problems, and furnish examples of mathematical techniques 
which are currently useful. It is hoped that the reader will be stimu- 
lated to make contributions of his own, either in the way of developing 
better mathematical techniques, or finding more ingenious uses of 
existing ones. 

Besides presenting much new and advanced material, an effort is 
made in these Proceedings to give readers basic information, in 
fields other than their own, which they need to have a full under- 
standing of space problems in their own fields. Accordingly an effort 
is made to acquaint the mathematical specialists with the key space 
problems in aerodynamics, geophysics, orbit theory, etc.; to acquaint 
orbit specialists with useful mathematical techniques, and to give 
them enough background in geophysics, aerodynamics, etc. for them 
to see the relevance of these areas for the new orbits required for 
space exploration; and so on, for other areas represented in space 
activity. Thus, for the benefit of the readers who are not specialists 
in orbit theory, there is included the chapter Elliptic motion by 
J. M. A. Danby. This is a summary exposition of the simplest orbit 
problem, the problem of two bodies. However, it includes only those 
points of especial relevance for space activity, and makes no effort to 
cover all developments that have been made in the two-body problem 
in the more than three hundred years since Newton gave the first 
solution. 

Useful mathematical techniques for orbit studies are presented in 
the chapters Matrix methods by J. M. A. Danby, The Lagrange- 
Hamilton-Jacobi mechanics by Boris Garfinkel, Stability and small 
oscillations about equilibrium and periodic motions by P. J. Message, 
Lectures on regularization by Paul B. Richards, and The spheroidal 
method in satellite astronomy by John P. Vinti. For near earth satel- 
lites, precession and nutation play such a prominent role that the 
chapter Precession and nutation by Alan Fletcher is almost required 
reading for anyone concerned with near earth satellites. For satellites 
that go further afield and also come near the moon, the motion is 
essentially covered by the “restricted problem of three bodies” in 
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which the mass of one body is infinitesimal compared to the masses 
of the other two. This is dealt with in the chapter Notes on a two -degree - 
of -freedom irreversible dynamical system: the restricted problem of three 
bodies by Victor Szebehely. The chapters Prvblems of stellar dynamics 
by George Contopoulos and Qualitative methods in the n-body problem 
by Harry Pollard are explanations of some classic techniques of 
interest. 

To a first approximation, a small object will remain at one vertex 
of an equilateral triangle of which the earth and moon are the other 
two vertices; it rotates at the same rate as the earth and moon about 
a common center of gravity. This “triangular point” has been 
proposed as a good place to park a permanent space station, say for 
communication purposes. However, the pull of the sun would cause 
the space station to wander about the “triangular point,” perhaps to 
the extent of rendering it useless. Though this question is of 
immediate concern, no completely satisfactory treatment exists. Five 
chapters deal with aspects of this, namely Motion in the vicinity of the 
triangular libration centers by Andre Deprit, Motion of a particle in 
the vicinity of a triangular libration point in the earth-moon system 
by J. Pieter de Vries, The dominant features of the long-period libra- 
tions of the Trojan minor planets by P. J. Message, Outline of a theory 
of nonperiodic motions in the neighborhood of the long-period librations 
about the equilateral points of the restricted problem of three bodies by 
Eugene Rabe, and Elements of a theory of librational motions in the 
elliptical restricted problem by Eugene Rabe. 

Many artificial satellites have orbits very close to the earth, and as 
a result these orbits are much more perturbed by irregularities in the 
potential field of the earth than is the case for the more distant 
natural celestial objects. This is compelling a more careful study of 
the distribution of masses within the earth, as well as requiring new 
methods for taking account of the effects on orbits of irregularities in 
this distribution. Thus it is that two chapters are devoted to ques- 
tions of geophysics, and relevant matters, namely, The shape of the 
earth in the light of recent discoveries in space science by John A. 
O’Keefe and The stability of a rotating liquid mass by John A. O’Keefe. 

If the orbit of some satellite is known with sufficient precision over 
a considerable period of time, one can make many inferences about 
the potential field of the earth. The determination of such an orbit is 
based on observations from various stations on the earth. To deter- 
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mine the orbit with precision from these observations requires know- 
ledge of the positions of the stations relative to the equipotential 
surfaces of the earth’s potential field, and hence some knowledge of 
the potential field. The question of finding what orbits and potential 
fields are consistent with a set of observations is very difficult at best. 
Since the observations cannot be totally free from error, it may be 
that no orbit or potential field is exactly consistent with them, and 
one must use variational and optimization techniques to find an orbit 
and potential field that gives a “best fit.” A specific discussion of this 
point is given in the chapter Geodetic problems and satellite orbits 
by W. H. Guier. Relevant general mathematical techniques are 
discussed in the chapter Elements of calculus of variations and opti- 
mum control theory by Magnus R. Hestenes. 

Many artificial satellites have low orbits, where the drag of the 
atmosphere is not negligible. To understand better the nature of this 
drag requires investigation into some difficult areas of aerodynamics. 
Presentations were made at the Seminar of the latest results in this 
area, and are recorded in the chapters Basic fluid dynamics by S. F. 
Shen and Shock waves in rarefied gases by S. F. Shen. In extreme 
cases, such as re-entry or flow inside rocket nozzles, there may be 
chemical dissociation and other effects. These are discussed in the 
chapter Models of gas flows with chemical and radiative effects by 
F. K. Moore. The specific effects of drag on a satellite orbit are dis- 
cussed in the chapter Decay of orbits by P. J. Message. Other 
peculiarities of the orbits of artificial satellites are discussed in the 
chapter The effect of radiation pressure on the motion of an artificial 
satellite by Gen-ichiro Hori. 

In all too many problems of the most immediate urgency, no 
suitable theoretical treatments have yet been perfected, and the best 
that can be done is to seek approximate numerical answers by means 
of high speed computers. A discussion of computational techniques 
is given in the chapter Special computation procedures for differential 
equations by S. V. Parter. 

Although very little was said at the 1963 Seminar about general 
techniques for solving nonlinear differential equations, a chapter 
on the subject is included from an earlier Institute, namely The two 
variable expansion procedure for the approximate solution of certain 
nonlinear differential equations by J. Kevorkian. 

A topic which now particularly timely, namely the rendezvous of 


XVI 


FOREWORD 


two space vehicles, is treated in Rendezvous problems by J. C . Houbolt . 

The Seminar was supported by the following government agencies: 

National Science Foundation, 

Air Force Office of Scientific Research, 

National Aeronautics and Space Administration, 

Army Research Office (Durham), 

Office of Naval Research, 

Atomic Energy Commision. 

The success of such a Seminar is the result of much dedicated work 
and planning by many individuals. Gordon L. Walker, Executive 
Director of the American Mathematical Society, contributed greatly 
to the planning and functioning of the Seminar. Edmund T. Cranch, 
of the Engineering College at Cornell University, served as Adminis- 
trative Director. He handled matters of preparation skillfully and, 
with the assistance of Margaret Kellar from the Office of the 
American Mathematical Society, managed the day-by-day operation 
of the Seminar in a highly satisfactory fashion. The enthusiastic 
office staff of Ann Smith, Virginia Cranch, and Neva Strever were 
effective in the many things, both obvious and unseen, that they did 
to promote the success and unity of the Seminar. Space for offices 
and meetings, and many forms of cooperation, were furnished by 
Cornell University and members of its staff. The scientific planning 
and direction were done by the Joint Invitations and Organizing 
Committee. Its members were Dirk Brouwer, Director of Yale 
University Observatory; Morris Davis, Director of Yale Computer 
Center; William R. Sears, Director of the Graduate School of Aero- 
space Engineering of Cornell University; Victor Szebehely, Manager 
of Space Mechanics Operation, General Electric Company; and 
Chairman, J. Barkley Rosser, Director U. S. Army Mathematics 
Research Center, The University of Wisconsin. During the Seminar, 
Dirk Brouwer served as Associate Director and Morris Davis as 
Assistant Director. They assumed day-by-day responsibility for the 
staff and program, and also furnished valuable advice and continuity 
with previous Institutes. 
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Elliptic Motion 


1. Introduction to the mechanics of celestial bodies. 

1.1. Newton’s Law of Gravitation. Celestial mechanics is, in the 
main, a branch of Newtonian mechanics, and the fundamental law is 
Newton’s law of gravitation. It is true that this law may not cover 
every contingency in cosmogony, but its inadequacies in celestial 
mechanics are few indeed. Also, in cases where the relevant argu- 
ments of general relativity have achieved explicit forms, the result- 
ing modifications to motion governed by Newton’s laws have been 
dealt with by established perturbation theories of celestial mechanics 
(see [5]). 

Newton’s law states that: “any two particles attract each other 
with a force that is proportional to the product of their masses 
and inversely proportional to the square of the distance between 
them”. Let the particles have masses m 1 and m 2 , and position vec- 
tors T] and r 2 , respectively. Then the force exerted by m, on m, can 
be written 


- k 2 m 1 m 2 ( ri - r 2 ) / 1 r 2 - r 2 | 3 . 

(In these notes a vector is written in boldface type. A unit vector 
is written with a “cap” above it, i.e., r, and |r|, or simply “r” stand 
for the modulus of r. It is assumed that the reader is acquainted with 
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elementary vector algebra and calculus; if not, see [2].) k 2 is often 
written as “G”, and the value in c.g.s. units is about 6.67 X 10 -8 . 

Consider a system of masses m x ,m 2 ,---,m n at r 2 , • • • , r„. They 
will exert a total force on a mass m at r of amount 

- k 2 m Y. m i ( r - r.) / 1 r - r, | 3 . 

1 = 1 

The particle m will experience some force wherever it is, and the n 
bodies are said to set up a “field of force”. The strength of a field of 
force at a point r is the force exerted on a particle of unit mass placed 
at r. Strictly, the word “force” in this context means “force per 
unit mass”. The n bodies produce, therefore, a field of force 

n 

(1.1.1) - k 2 Y, «i(r- r f ) / 1 r — r ; | 3 . 

( = 1 

In the system of n masses the force acting on m, is 
- k 2 X m i m j( T ‘ ~ b) / lb ~ bl 3 - 

This can be calculated from the gradient of the “force function 

n 

(1.1.2) U - k 2 Y.'^ ) tn i m j /\ti- tj\. 

j= 1 i<j 

For instance, the x-component would be dU/dxi. Since 

I b - bl = ( (*. - X]) 2 + (y.- - y;) 2 + (b - z j) 2 } il2 

then 

<9 1 r t — r y | fdXi = (x, — x y ){ (x t — Xj) 2 + (y* — y>) 2 + ( Zi — zj ) 2 ) 1/2 
and 

dU/dxi = - k 2 X m, myfjc, - Xj)/\ti- r,| 3 . 

The force function is the negative of the work that would be done 
in assembling the system of n bodies from a state of infinite diffusion. 
As the words are normally used, it is minus the potential; but this 
convention is not universal, and I shall use only force functions here. 

The transition from particles to solid bodies is accomplished by 
integration. Consider the force function of a uniform, thin spherical 
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Figure 1. Force Due to a Shell 

shell at a point 0 outside the shell (see Figure 1). Let the shell have 
center C, radius a, thickness da , and density p; and let OC = r. If P 
is a point on the shell, let the angle OCP — 8 . Divide the shell into 
thin rings perpendicular to OC and defined by 0 lying within the 
limits 8 and 8 + d8. The radius of a ring is a sin 8 , and its mass is 

p2*a sin0ac?0da. 

Any element of the ring is at the distance 

(r 2 + a 2 - 2ar cos 0) 1/2 

from 0, so that the force function of the ring at 0 is 
k 2 p2ica 2 da sin 9d8(r 2 + a 2 — 2 ar cos 0)“ 1/2 , 
and the total force function due to the shell is 

U = k 2 p2na 2 da sin0d0(r 2 + a 2 — 2ar cos0) ~ 1/2 , 

where the square root must always be positive. This can be inte- 
grated at once to give 

U = ~ k 2 dm 1" — (r 2 + a 2 — 2 ar cos0) 1/2 l 
2 Lra J*=o 

= k 2 dm/r y 

where dm — 47ra 2 pda is the mass of the shell. This means that, so 
far as 0 is concerned, the shell could just as well have all its mass 
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concentrated at C. This must also apply to a shell of finite thickness, 
since the result is not affected by integration over a, and it applies 
in particular to any solid body that is constructed in concentric 
spherical shells, provided we are outside it. 

If, therefore, we have a system of n bodies, each having spherical 
symmetry, then they can be considered as particles generating a 
force function (1.1.2), provided they do not approach too close to 
each other. The mass of each body is considered to be concentrated 
at its center of gravity, and the coordinates of a body are the coordi- 
nates of its center of gravity. 

Fortunately, in most problems of celestial mechanics the bodies 
can be assumed to be spheres. In the first place they are, in fact, 
nearly spherical, and in the second place the distances between the 
bodies are usually large compared with the dimensions of the bodies 
themselves. In the case of the motion of an artificial satellite the 
latter condition does not hold, and the oblateness of the Earth 
actually causes major perturbations in the motion. 

Outside a gravitating body the force function must satisfy La- 
place’s equation, 

V 2 U=d 2 U/dx 2 + d 2 U/dy 2 + d 2 U/dz 2 = 0. 

(This can be proved by differentiating equation (1.1.2); the summa- 
tion is replaced by an integration.) It transpires that the force 
function of the body can normally be expanded in a power series 
in 1/r, where r is the distance from its center of mass; the coeffici- 
ents are called sperical harmonics. If, as is often the case, the body 
has symmetry about an axis, the force function can be expressed as 

d.1.3) -^J 2 P 2 -pJ 3 P 3 - •••), 

where the P t are Legendre polynomials (functions of the latitude) 
and the J t are constants; if the body is nearly spherical, the latter 
become small quite rapidly. Now it would be possible to find the 
force function of such a body by integration, if we knew precisely 
how it was put together. Failing this knowledge, it is still possible 
to write down its force function directly, so far as all the variable 
quantities are concerned. The theory of the motion of an artificial 
satellite, without drag, can be constructed using the force function 
(1.1.3). Then, later, observations may furnish the values of the Ji. 


ELLIPTIC MOTION 


5 


A lack of knowledge about the insides of a body is therefore no great 
hardship when its force function is required. (For more details, see 
[2, Chapter 4].) 

1.2. Newton's Laws of Motion. We are concerned with Newtonian 
mechanics, the basic assumptions of which are contained in New- 
ton’s laws of motion. These are: 

(1) Every particle continues in a state of rest or uniform motion 
in a straight line unless it is compelled by some external force to 
change that state. 

(2) The rate of change of the linear momentum of a particle is 
proportional to the force applied to the particle and takes place 
in the same direction as that force. 

(3) The mutual actions of any two bodies are always equal and 
oppositely directed. 

A man who observes the motion of surrounding bodies that are 
not acted on by forces, and notes that they are not accelerated is 
entitled to feel that, for practical purposes, he is at rest with respect 
to some inertial system of reference. But if these bodies have any 
accelerations, then he is not (although he may invent forces such as 
centrifugal or Coriolis forces, to preserve the illusion). Certainly, 
no point fixed on the surface of the Earth could be the origin of an 
inertial system, although some sufficiently parochial experiments 
might give that impression. Motion observed in an ideal case by a 
nonrotating man at the center of the Earth would still show acceler- 
ation because of the action of the Sun, Moon, etc., on the Earth. 
Similarly, motion observed from the center of mass of the solar 
system should be affected by nearby stars, and the field of the 
galaxy (to say nothing about nearby galaxies): this is true in prin- 
ciple; but there is no known experiment to detect such effects, so 
that no purpose is served by considering acceleration with respect to 
the center of the galazy, and so on. So we shall not worry about 
the practical difficulties of choosing an inertial reference system, and 
we are certainly not concerned here with the thornier difficulties as 
to whether such a system can exist at all. We adopt the attitude 
that, given any problem in Newtonian mechanics, there exists an 
inertial system with respect to which the equations of motion can 
be written down; but no special assumption must be made about 
the whereabouts of the origin. Once the equations of motion have 
been set up, algebra will enable the origin to be transferred to this 
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place or that. Also, inspection of some terms in the equations may 
result in their being rejected on account of their smallness. But the 
original equations must be written down without any assumptions 
being made about the origin, or the relative importance of different 
terms. 

The measurement of '‘uniform motion” requires the use of a 
“uniformly flowing” time. The use of Universal Time (which is based 
on the rotation of the Earth) threw up accelerations of the Moon and 
planets that could not be explained by Newtonian mechanics, but 
which could result from nonuniform flowing of Universal Time. 
A suitable time has therefore been invented; this is Ephemeris Time. 
Its relation with Universal Time is given in the almanacs. 

The second law can be applied only to motion observed with 
respect to an inertial reference system. If a particle of mass m is 
at r and the resultant of the forces acting on the particle is F, then 


( 1 . 2 . 1 ) 



Two important formulas follow from this. Firstly, 


( 1 . 2 . 2 ) 


rX 


F -s( ,xm l)- 


or “the moment of the external forces is equal to the rate of change 
of the angular momentum”. Then, if F is the gradient of a force 
function U that does not contain the time explicitly, and if m is 
constant, 


(1.2.3) 



U = constant. 


(For, differentiating (1.2.3) with respect to the time, we have the 
scalar product of dr/dt and (1.2.1).) This is the energy integral. 

The third law is obeyed by Newton’s law of gravitation, and is 
needed in a derivation of this law from Kepler’s laws of planetary 
motion (quoted in §2.4). 

Newton’s laws apply directly to the motion of particles. If a 
body of finite extent is acted on by a system of forces, then the 
motion of its center of mass can be found by shifting the forces 
parallel to themselves so that their lines of action pass through the 
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center of mass. The motion about the center of mass is considered, 
basically, through equation (1.2.2); subjects such as precession or 
physical libration fall under this heading; but they will not be con- 
sidered here. 

1.3. Equations of Motion . Consider the motion of n particles with 
fixed masses m u which have position vectors p u p 2 , 

p n , with respect to an inertial reference system. The equation of mo- 
tion of m, is 

(1.3.1) niiPi = — k 2 mi Y rnj Pl ? Pj , 

Pij 

where p,;,- = |p t — pj\. A prime stands for differentiation with respect 
to the time. Adding the equations for all the particles, the forces 
cancel (from the algebra, or from Newton’s third law) leaving 

(1.3.2) i> lP f =0. 

i=i 

But£ m ;P; is proportional to the position vector of the center of 
mass of the system, and (1.3.2) says that this not accelerated with 
respect to the original inertial system; therefore the center of mass 
could be the origin of an inertial reference system. 

Multiply (1.3.1) vectorially by r, X , and add all n equations. The 
right-hand sides again cancel, leaving 

n n 

(1.3.3) m,r, X rf = 0, or £ m, r, X r[ = h 

1=1 1=1 

where h is a constant vector. The plane through the center of mass 
of the system and perpendicular to h is constant throughout the 
motion, and is called the “invariable plane” of the system. 

The equations (1.3.1) can be written in the form 

(1.3.4) m.p'f = ViU, 

where, if Pi has components (&, r^, fr)> V, has components d/d£,-, 
d/d??,-, d/df„ and where 

(1.3.5) • 

i<j~l Pij 

We therefore have the energy integral for the whole system, 
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1 * 

(1.3.6) x rriip'i 2 - U = constant. 

^ 1=1 


But no integral can be written down for an individual member of the 
system. 

Suppose that one mass, m n , is considered to be dominant, either 
because of its relatively great magnitude, or because the motion in 
which we are interested takes place very close to it. Subtracting the 
equation of motion of m n from that of mi (after dividing by m n and 
m„ respectively) we find 


Pi 


- pS = - k 2 z 


m, 


Pj 




3 

Pij 


+ k 2 Z 


m, 


j = i 


Pn — Pj 
3 

Pnj 


— — k 2 m n 





P: 



; = L Py 


Pn; J 


Now let the position vector of m l with respect to m n be r^, so that 
r/ pi p n . Then 


(1.3.7) 


+ k 2 (m n + m t ) -4 

n 


«* 2 z 


m r T > 

rrtj — 



Further, if 


(1.3.8) 



then 

r n-\ 

(1.3.9) rf + fe 2 (m„ + mj) 4 = 2Z m j v * 

r i >=iw>» 


Now if all the masses except m n and mj, were zero, the right-hand 
sides of (1.3.7) or (1.3.9) would vanish, and the equations of motion 
would refer to the two-body problem; the solution of this is called 
Keplerian motion, and is described in the following chapter. It is 
frequently possible in celestial mechanics to find a dominant body, 
m n , such that the terms on the right-hand side of (1.3.9) are much 
smaller than k 2 (m n + In this case the motion can be con- 
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sidered as Keplerian motion “perturbed” by the forces on the right- 
hand side. This is why Keplerian motion is so important in celestial 
mechanics. The word “perturbation” normally implies a departure 
from Keplerian motion; the forces on the right-hand side of (1.3.9) 
are “perturbing forces” and the R l} are “perturbative functions”. 

The reference system in (1.3.9) is noninertial. The terms on the 
right-hand side include the “direct” attractions of the bodies on 
and the “indirect” attractions on m n , the origin. In a practical 
application many of these terms might be found to be negligible; 
but it can happen that the direct attraction of a body is negligible, 
but the indirect attraction is not. Further modifications can be 
made by adjusting the origin, and the mass of the dominant body; 
for details, see [2, §9.5]. 

2. The two- body problem. 

2.1. Properties of Conics. Any orbit in the two-body problem is 
a conic, and before discussing the solution we shall briefly review 
the relevant properties of conics. 

The polar equation of a conic can be written as 

(2.1.1) p/r = 1 + e cos/, 

where the origin is at a focus of the conic, and / is the polar angle, 
measured from the major axis. [Equation (2.1.1) follows from the 
“focus- directrix” definition of a conic; i.e., that it is the locus of a 
point such that the ratio of its distance from a fixed point (a focus) 
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Figure 2. Features of an Ellipse 
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to its distance from a fixed line (a directrix) is constant, the value 
of the constant being equal to e.] e is the eccentricity ; if e = 0 the 
conic is a circle; if e is less than one, it is an ellipse, which is bounded; 
if e = 1 it is a parabola; if e is greater than one it is a hyperbola. 

Let an ellipse have center C, foci S, S' , major axis AA' , and minor 
axis BB', as shown in Figure 2. The following relations are useful, 
and should be memorized: 

CA = CA' = a, 

CB = CB' = b, 

SA = q = a(l — e ), 

SA' = q' = a( 1 + e), 

CS = CS' = ae, 

P = a{ 1 - e 2 ), 

b 2 = a 2 (l - e 2 ), 

SB = a. 

The ellipse can be obtained by the vertical projection of a circle. 
Let Q be a point on the circumference of the circle, and P the corre- 
sponding point on the ellipse, and let QP cut the major axis at R. 
Then PR/QR = b/a. Further, let z ACQ = E (the “eccentric anom- 
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aly”). With origin at S', let the X-axis point along SA t and the Y-axis 
point along the latus rectum, as shown in Figure 3. This reference 
system will be called the “orbital reference system”. Then the co- 
ordinates of P can be written: 

(2.1.2) X = a(cosi£ — e) — r cos/, Y = b sin 2? = rsin/. 

The area of the ellipse is nab. We also have 

(2.1.3) r = V(X 2 + Y 2 ) = a(l — e cosE). 

Formulas for the parabola can be obtained from those for the 
ellipse by (carefully) letting a— > °o and e->l. It is safest first to 
eliminate a or e using q = a(l - e), since q remains finite. Suitable 
modifications to cover hyperbolic motion will be given in §2.4. 

2.2. The Solution of the Orbit. Consider two particles of mass m x 
and m 2 . Let the position vector of m 2 with respect to m x be r. From 
(1.3.7) we see that the equation of motion of m 2 is 

(2.2.1) x" + k 2 {mi + m 2 )x/r 3 = 0. 

If the origin were at the center of mass of the two bodies, the 
reference system (nonrotating) would be inertial. Then if the masses 
were at ri and r 2 , the equation of motion of m 2 would be 

m 2 x 2 = — k 2 m 1 m 2 r/r 3 
or, since r 2 = [ m x / (m x + m 2 ) ] r, 

(2.2.2) r 2 = — k 2 [mil{mi + /n 2 ) 2 ]r 2 /r 2 . 

Equations (2.2.1) and (2.2.2) are of the form 

(2.2.3) r" = - fit/r 3 
but with different values of n. 

Equation (2.2.3) requires six constants of integration for its solu- 
tion. Taking r X (2.2.3), we find r X r" = 0, so that 

(2.2.4) r X r' = h, a constant. 

h supplies three arbitrary constants. From (2.2.4), r • h = 0, which 
is the equation of a plane through the origin. The motion must take 
place in this plane; h determines its orientation, as well as the magni- 
tude of the angular momentum. Now take h X (2.2.3), and use 
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h X r" = — 4(rXr') Xr 
r 

= - p W t' - (r-r')r] 

= -pk 2 r'-{rr')r] 

= - M [r '/r- rr'/r 2 ] 

= -4 ( r/r, 

= - 

[r / is the velocity vector; r ' is the rate of change of the scalar r. 
Differentiating r 2 = r 2 , we find r • r ' — rr'\ a useful relation.] Inte- 
grating, we obtain 

(2.2.4) h X r' = - - P, 

where P is an arbitrary vector; but since it is perpendicular to h, it 
only contains two arbitrary constants. The remaining constant of 
the motion will be considered in the following section. Taking 
r • (2.2.4) we obtain 

r • (h X r') = - nr — P • r 
or 

- h - (rx r') = - Mf - P-r 
or 

h 2 = nr + P • r 


or 

^ = 1 + (P/m) • 

r 

This is the same as equation (2.1.1). We have /i 2 /m = P, the vector 
P points along the major axis toward pericentron, and P = pe. The 
angle f is called the “true anomaly”. If e is greater than one, only 
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the branch of the hyperbola that is concave toward the origin can 
be described in the motion. 

2.3. The Orbit in Time. The vector r has components r' along t 
and rf' perpendicular to it; therefore the modulus of r X r' is r 2 f', 
which is twice the rate of change of the area swept out by the radius 
vector. From (2.2.4) we have 

(2.3.1) r 2 (df/dt) = h. 

The integration of this equation supplies the final constant of inte- 
gration. Substituting for r from (2.1.1) we get a simple integral; 
but except when e = 1, it is convenient to introduce an intermediate 
angle, the eccentric anomaly. 

Assume the motion to be elliptic. Differentiating (2.1.3) we find 

r' = ae sin EE'. 

And differentiating r cos / = a(cosE — e ) (from (2.1.2)), 
r' cos / — r sin //' = — a sin EE'. 

Eliminating r' and /' from these two equations and (2.3.1) we find 
h sin/ = a sin£E'(l + e cos f)r. 

Now using the relation 

h 2 = mP = m«(1 — e 2 ), 

and the formulas (2.1.2) and (2.1.3) to eliminate f and r, we find 
Vu i/a 3 ) = (1 - ecos E)E', 
which can be integrated to give 

V (m/o 3 ) (t — T) — E — e sin E, 

where T is a constant of integration; it is equal to the time when 
E = 0, or when the body is at pericentron. This is Kepler’s equation. 

| By letting the eccentric anomaly go from 0 to 2 tt, we get the time 
| for a complete revolution, or the period of the motion, which is 

(2.3.2) p = 2tt \/ (a 3 / ju) . 

The “mean motion”, n, is defined by 

(2.3.3) 


n = 2ir / P, 
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so that n 2 a 3 = The angle 

(2.3.4) M = n(t — T) 

is defined as the “mean anamaly”. So Kepler’s equation can be 
written as 

(2.3.5) M = E - e sinE. 

Normally we are given the time, and want to calculate E . That 
there is a unique solution can be seen from the fact that the right- 
hand side of (2.3.5) is monotonic increasing with E (for its differen- 
tial coefficient with respect to E is (1 — e cos E), which is always 
positive). One of the best ways to find E is to use Newton’s Method. 
If E 0 is a good guess, and E is correct, let 

A E = E - E 0 , 


and 

A M = M — M 0 = M — Eq + e sin E 0 . 

Then if (A E) 2 is neglected, 

A E = AM/ (1 — e cos E 0 ) . 

Because of the approximation, this correction is not exact, and the 
process will have to be repeated until A M becomes less than some 
small pre-assigned value. This process converges best when e is 
small, when a good first guess is 

E 0 = M + e sin M, 

(although the series for E in terms of M and powers of e, given in 
the following chapter, can be truncated further along if desired). For 
more details, and for a discussion of the situation when e is nearly 
equal to one, see [3]. 

2.4. Miscellaneous Properties . Kepler’s three laws of planetary 
motion are: 

(1) The orbit of each planet is an ellipse, with the Sun at one of its 
foci. (Actually “Keplerian motion” is often now taken to include 
parabolic and hyperbolic motion, so that “conic” might replace 
“ellipse”.) 

(2) Each planet revolves so that the line joining it to the Sun 
sweeps out equal areas in equal intervals of time. (Therefore the 
acceleration of the planet is directed toward the Sun, and so also is 
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the force acting on the planet. From this law, and the first, Newton’s 
law of gravitation can be deduced.) 

(3) The squares of the periods of any two planets are in the same 
proportion as the cubes of their mean distances from the Sun. 
(This law should be modified so that P 2 (m l + m 2 ) /a 3 is a constant 
for any two bodies, where a is the semimajor axis of the relative 
orbit, P is the period and and m 2 are the masses of the bodies. 
The law can be used to find the mass of a planet that has a satellite.) 

Many important formulas for elliptic motion have been given 
already. A notable omission is the energy integral, 

(2.4.1) r ' 2 = M (2/r- 1/a). 

The parabolic velocity, or velocity of escape is found by putting 
1/a = 0. The circular velocity is found by putting r = a. 

When changing from E to f or / to E, the following formulas are 


useful: 





cos / = 

(cos E ■ 

- e)/(l — ecos E), 

(2.4.2) 

sin/ = 

V(1 - 

e 2 )sinf?/(l — e cos E), 


cos E = 

(e + cos/)/(l + e cos f). 


sin E = 

\/(l~ 

e 2 )sin//(l + ecosf). 


Using the relation tan 2 (//2) = (1 - cos/)/(l + cos/), it is easy to 
verify that 

(2.4.3) tan (//2) = tan(E/2). 

When using these formulas, it should be remembered that // 2 and 
E/2 always lie in the same quadrant. If we write 

e = sin </> (0 ^ <t> < *•/ 2), 

as is commonly done, then 

tan(//2) = tan(ir/4 + 0/2)tan(£/2). 

From Kepler’s equation, and (2.1.3) we have 

(2.4.4) 

Also we have 


E' = na/r. 
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(2.4.5) r' = na 2 e sin E/r = ( en/h)smf . 

Formulas (2.1.2) are important. Differentiating them, we find 

(2.4.6) X' = - na 2 smE/r, Y' = na 2 \/(l - « 2 ) «*£/r. 

In parabolic motion let q be the pericentron distance, then the 
equation of the orbit is 

r = q sec 2 (//2). 

An equation involving the time is 

i tan 3 (//2) + tan(//2) = %/ (m/2 q 3 )(t - T). 

tJ 

Formulas for hyperbolic motion can be derived from those for 
elliptic motion as follows. Assume a to be negative for a hyperbola. 
If / 2 = — 1, replace E by iF (so that cos E becomes coshF and 
sin£ becomes i sinhF), replace n by — iv , where u z a s = — n, and 
v is positive, and replace \/(l — e 2 ) by i\/ \e l - 1). 

Many important formulas have been omitted here. The reader 
should consult, in particular, [3]. 

2.5. The Orbit in Space. An orbit is defined by six constants, and 
these require some kind of reference system. The celestial equator 
or ecliptic are often used as reference planes, with the direction of 
the vernal equinox defining an axis. Neither of these planes is fixed, 
and it is necessary to use their mean positions for some definite 
epoch. 

A suitable set of constants would be the components of position 
and velocity, r 0 , To, at some time t 0 ; it is possible to calculate from 
these the position r at any time t (formulas for the calculation of the 
velocity are easily deduced and will not be given here). Since the 
motion takes place in a plane, it must be possible to resolve r along 
the directions of r 0 and r ( ). So we can write 

(2.5.1) r = ff 0 + gTo, 

where f and g are scalar functions of t Q and t and the initial condi- 
tions. From (2.5.1) we find 

fh = r X To and gh = r 0 X r. 

These are vector equations, independent of the reference system. 
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So f and g can be evaluated by substituting components referred to 
the “orbital reference system” defined in §2.1. The components are 
given by formulas (2.1.2) and (2.4.6). After substitution and some 
simplification, we find 

f ~ - [ cos {E — E 0 ) — e cos E 0 ] , 

(2.5.2) r 

g = - [ sin (E — E 0 ) — e(sin E — sin E 0 ) ]. 
n 

Before using (2.5.1) and (2.5.2) to calculate r, it would be neces> 
sary to calculate a, e, E 0i and E. We are given r 0 and Tq. (2.4.1) 
will give a. Then (2.1.3) and (2.4.5) will give e cos E 0 and e si nE 0 , 
from which e and E 0 can be found. (In using (2.4.5), remember that 
= i*o • To-) Finally, Kepler’s equation can be used to find E. 

In the formulas above a, e, and E 0 are introduced as intermediate 
elements; but they help to give a picture of the shape and size of 
the orbit, and the initial whereabouts in the orbit, that r 0 and ro com- 
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Figure 4. Geometrical Elements. 
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pletely fail to do. It is more usual to use six constants, each of which 
has an easily visualized geometrical meaning; these are the “geome- 
trical elements” of the orbit, a and e are two possible elements, 
and a third is a time of pericentron passage, T, or any number, such 
as the mean anomaly at some time, that enables the position in the 
ellipse to be found at any time. The description of the orientation 
of the orbit in space requires three angles, illustrated in Figure 4. 
In the figure the fundamental plane is the ecliptic (it could equally 
well be the celestial equator, if preferred) , the Sun is at 0, Ox points 
toward the vernal equinox and Oz toward the north pole of the 
ecliptic. The plane of the orbit cuts the celestial sphere in the great 
circle NPR where N is the point where the body in its orbit crosses 
the ecliptic, going north; it is called the “ascending node”. The angle 
xON (measured eastward around the ecliptic) is called the “longi- 
tude of the ascending node” and is written as ft. The angle between 
the ecliptic and the plane of the orbit is the “inclination”, /. For 
0 ^ / < 90° the orbit is direct; for 90° < / < 180°, it is retrograde. 
If OP points toward pericentron, the angle NOP — o> (measured in 
the sense in which the orbit is described) is called the “argument of 
pericentron”. 

These six constants are sufficient to give a geometrical picture of 
the orbit, and to enable position (and velocity) in the orbit to be 
calculated at any time. Among the alternatives often used is xs — ft 
+ co, called the “longitude of pericentron”. (The word “pericentron” 
would be replaced by “perihelion”, or “perigee”, etc. as appropriate.) 

To find the position at any time when the elements are given, 
first solve Kepler’s equation for the appropriate value of the ec- 
centric anomaly, and then use equations (2.1.2) to find the coordi- 
nates in the orbital reference system. The coordinates in this system 
can be related to the coordinates in any other system by a series of 
rotations. The following successive rotations: — o> about the Z-axis, 
— I about the new x-axis, and — ft about the new z-axis, will trans- 
form coordinates in the orbital reference system to those in the 
jc-, y-, z-system of the figure. A further rotation about the x-axis 
through — e (where e is the obliquity of the ecliptic) will lead to 
coordinates based on the celestial equator; these are necessary if 
right ascension and declination are to be calculated. The transforma- 
tion resulting from a rotation about an axis of reference can be most 
conveniently described by a matrix multiplication. For details, see 
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[2, Appendix B], The result of all the rotations described above can 
be written in the form 


(2.5.3) 


' x~ 


p> Qx 

y 

= 

Py Qy 

z 


1 

1 



where the P's and Q’s are direction cosines of the X- and Y-axes with 
respect to the jc-, y-, z-axes. 

Suppose that it is required to find the geometrical elements when 
position and velocity, r 0 , r' 0 , are given for a time t 0 . a is found from 
(2.4.1) and e and E 0 from (2.1.3) and (2.4.5), as before. Then T, or 
M 0 , the mean anomaly at the epoch, can be found from Kepler’s 
equation. The individual angles 12, w, and I might now be found from 
the formulas: 


h = r 0 X Tq 


— (^x* ky> ^z) y 

h x = h sin 12 sin /, 


(2.5.4) 


h y = — h cos 12 sin/, 
h z — h cos /. 


/from (2.4.3), 


sin u = 


z 

r 


cosec /, 


r cos u = x cos 12 + y sin 12, 


(u is the “argument of the latitude”) 

w = u — /, 

where an extra 360° may have to be added to make to lie between 0 
and 360°. 

Alternatively, it may be better to find the P’s and Q’s of (2.5.3) 
directly. (2.5.3) can be written more generally as 


"Xo 

Xo' 


’Px 

Qx' 

yo 

y o 

= 

Py 

Qy 

_Zo 

z 0 . 


Px 

Qx. 


’X 0 Xf 
Y 0 Yq 


where X 0 = a(cosP 0 — e)» etc. from (2.1.2) and (2.4.6). Then, solving 
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for the P’s and Q’s, we find (since X 0 Y£ - Y 0 Xq — h), 


(2.5.5) 


m Px & 
Py Qy 
P z Q z 


X 0 

*6 


^0 

z 0 

Zo 


Yi/h -Xt/h 
— Yq! h Xq/ h 


The individual angles ft, w, and / can also be determined from the 
P’s and Q’s. 

For an account of the determination of the elements when two 
positions for two different times are given, see [3]. 

In certain cases some element can only be poorly determined. 
For instance, if e is small, E 0 and wor® cannot be found as accurate- 
ly as the other elements because, somewhere along the line, their 
calculation involves division by e. Similarly, if I is small, ft is poorly 
determined. It is possible to put too much emphasis on the difficul- 
ties that result, w or ft should not be considered as goals in them- 
selves. Suppose that the object of the work is to calculate position 
and velocity at any time; then it need not matter that for small e 
an angle such as u> is poorly determined (in fact there will be a multi- 
plication by e during the calculation) , and the accuracy of the final 
result need not suffer at all. Difficulties due to a small I can be 
avoided by using the P’s and Q’s. If a programmer is determined 
to avoid any division by e , there are several ways in which this can 
be achieved. One possibility is to use ecosE 0 and esinPo as ele- 
ments; there need be no doubt about their accuracy. Let E be the 
eccentric anomaly at time t, then from Kepler’s equation applied to 
the times t 0 and t , we find 


(2.5.6) 


n(t — J 0 ) — E — E 0 — € sin E + e sin E 0 

= A E — e cos E 0 sin A E — e sin E 0 cos A E + e sin E 0y 


where A E = E — E 0 . This can be solved for A E, and then e sin E 
and e cos E can be calculated, and (2.5.2) and (2.5.1) used to find 
the position at time t . 

If the elements are to be considered as slowly varying quantities 
in perturbed motion, other problems may arise, and different ele- 
ments are needed for special cases. 


3. Expansions in series. 

3.1. Expansions in Powers of the Eccentricity . The stumbling block 
in any attempt to express position in Keplerian motion explicitly 
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in terms of the time comes in any attempt to express the eccen- 
tric anomaly explicitly as a function of the mean anomaly. In general 
it cannot be done in a finite number of terms. But if the eccentricity 
is sufficiently small, approximate expressions can be developed that 
are good enough. Fortunately, nearly all the planets and satellites 
in the solar system have orbits with moderately small eccentricities. 

For a circular orbit, E = M. If e is small, then, writing Kepler’s 
equation in the form 

E — M — e sin E, 

we see that to the order of e, we can put 

Ei = M + e sin M. 

Now if we put E 2 = Ei + 5E U and ignore e 3 , we find 
E 2 — M + e sin M + ~ e 2 sin 2 Af. 

Further development along these lines becomes immensely tedi- 
ous, and it would be an advantage if some formula could be found 
that would give the general term. Such a formula is given by La- 
grange’s theorem, which can be stated for the problem in hand as 
follows: Let 

E = M + ef(E), 

then 

F(E) = F(M) + ^f{M)F'{M) + A- { [/(M) ] 2 F'(M) } + - - - 


+ 


e q d”- 1 
q\ dM 


\[f(M)] q F'(M)} + .... 


Now put F{E) 5E E , so that F'{E) — dF/dE = 1; and put f(E) 
= sin E . Then we get 


" 2 d 


E = M + ^ sin M+~ (sin 2 M) + 

1! 2! dM 


(3.1.1) 


n! dM 1 


(sin " +1 M) + 


+ * 


Any other F{E) y such as r F(E) — a(l — e cos E) y can be expanded 
similarly. 
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The general statement of Lagrange's theorem would be useful, 
because it includes the condition for convergence of the series (and 
it is not often that a question of the convergence of a series in celes- 
tial mechanics can be answered). Limitation of space prevents a dis- 
cussion here, but see [4, §46]. The upshot is that series in powers 
of the eccentricity converge for values of e less than 0.6627 
3.2. Applications of Lagrange's Theorem . An unattractive feature 
of (3.1.1) is that powers of trigonometric functions appear. It is 
usually simpler to deal with terms such as sin kM rather than sin M, 
so that Fourier series are generally preferable to power series. Also, 
it is laborious to change from one to the other, so that it is an 
advantage if a Fourier series can be generated in the first place. 
One way of doing this is to use the exponential function. For 

E lk M = cos kM + i sin kM , 

where E is the exponential and i 2 = — 1, so that what is generated 
as a power series in E lM becomes a Fourier series. 

Consider (2.4.3). It is usual in these developments to get rid of 
the square root, so we introduce 

l+_g = / A + e\ 

1-0 \ \ 1 — e/ 


0= ie(l + |8 2 ). 

Also, introducing sin<^ = e, we have = tan {</>. (3.2.2) could 
equally well have been written as 


(3.2.1) 

so that 

(3.2.2) 


0 = M + 


G-) 


(1 + 0 2 ), Af = 0. 


Then 0 ; can be expanded in powers of {e by Lagrange’s theorem 
to give 

^ -.?[(! 1(1 
- j s [ (I bft [ « ■ 
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For a term to survive the operation M = 0 after the differentiation, 
we must have 2p+j — l = q—l. Then for a definite value of p, 
q = 2p + j', so we can write 


(3.2.3) 


(2p + y — 1) ! 
(p + j)\p\ 


•'-'Ur) 

-«-y mhms* 


j'O' + 3) 
2 ! 



We are now in a position to consider expansions in powers of p. 
For the applications, put 


(3.2.4) if = log x, iE = logy, iM = logz, 

where i 2 = — 1, and the logs are to the base E, so that x = E if , etc. 
Then 


x k + 1/jc* = 2 cos kf, x k — l/x k = 2i sin kf, etc. 
From (2.4.3) and (3.2.1) we have 


*- 1 1+Py-l 
*+i i — (8 y + l ’ 


(3.2.5) 


y- 
l - fly’ 


or 


y = 


* + /3 
l + /8x’ 


Then from the first of these. 


log x = logy + log(l - Ply) - log(l - fly), 

and, bearing in mind that for |z| <1, log(l + z) = z — z 2 /2 
+ z 3 / 3 + • • • , we can write this as 

log* = logy + 0(y - 1/y) + | p 2 (y 2 - 1/y 2 ) H , 

so that, from (3.2.4) we have (after division by i) 

/9 3 sin3 E + •••^ . 

From (3.2.5) we see that to exchange x and y it is sufficient to 
change the sign of p. Therefore 


'■ypsi 


(3.2.6) f=E + 2[p sin E+ i/3 2 sin2£+ ± 


(3.2.7) 



i0 2 sin2/+|j8 3 sin3/H ^ . 
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Substituting from (3.2.4) into Kepler’s equation, we have 
log z = logy - i e(y - 1/y). 


Eliminating y, from (3.2.5), and using (3.2.2) to eliminate e, we can 
transform this to 

, . P ( l - ff 2 )!* 2 - 1) 

log z = logx + log(l + P/x) - log(l + Px) - ( x _J_ j8)(l + Jx) ' 

Now (1 — 0 2 )/(l + P 2 ) = \/(l — e 2 ) = cos </>, so that the final term 
on the right-hand side can be written as 

— fi COS (f>[x / (1 + fix) — (1/jc)/(1 + pfx) ] 

= |Scos<^i(l - p/x + p 2 /x 2 - •••) -x(l - px + p 2 x 2 - --Oj. 

Therefore, expanding the logarithms as before, and substituting from 
(3.2.4), we get 

M = / — 2 (^Psinf — | P 2 sin 2/ + ••• J 


(3.2.8) 


+ 2p cos<fi ( — sin/ + P sin 2/ - • • •) 

= f — 2^/3(l + cos</>) sin/ — P 2 ^ + cos</>^ sin 2/ 

+ p 3 ^ i + cos <t>) sin 3/ + • • • J . 


The difference between the mean and true anomalies is called the 
“equation of the center”. 

3.3. Fourier Series. The derivation of the series in the preceding 
section was a trifle roundabout. Before proceeding with the direct 
derivation of Fourier series we shall briefly state enough theorems 
to build up the relevant background. 

Let f(t) be a periodic function with bounded variation and period 
2 t r; let it be integrable for all f, so that the products f(t) sin pt, 
f(t) cospfare also integrable. Define 

1 f 2 * 

clq = — I f(t)dt, 

2ir Jo 
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i r 2 ’ i r 2 ' 

a p = - I f(t) cos ptdt, b p = - I /(f) sinpfdf. 

7T JO TT JO 

The series 

(3.3.1) a 0 + X (a P cospt + b p sin pt) 

p- i 

is called the Fourier series of /(0- If /(f) is continuous, its sum is 
equal to f(t). Furthermore, if its derivative is bounded, then the 
Fourier series is uniformly convergent. 

If the form (3.3.1) is accepted, then the formula for the coefficients 
is very easily recovered by multiplying through by cos pt or sinpf 
and integrating from 0 to so that every term but one vanishes. 
If f(t) is an even function, then only the a p appear, and it is sufficient 
to integrate from 0 to tt, and divide by tt/ 2. Similarly, if f(t) is an 
odd function, only the b p appear. 

Using the exponential function, we could also put 

(3.3.2) /(f) = Z «pE ipt , 
where 


a P = 


1 

2tt 



dt . 


(Note that as soon as trigonometric functions are replaced by ex- 
ponential functions, the summations must go from minus infinity to 
plus infinity.) 

Consider the expansion of the function a/r as a Fourier series in 
the mean anomaly. It is an even function of E , and consequently of 
M. Also a/r = dE/dM. Therefore 


a 1 f a 2 ^ C a 

-=- I - dM H — > cos pM I -cos pMdM 

r tt Jo r Jo r 


ir 

7 r Jo 


2 

dE + - ^ cos pM I cos(pi? — pe sin E)dE. 

* p-i 
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Define the “Bessel’s coefficient” J p (x), of order p and argument x 
by 

(3.3.3) J P (x) = \ j 0 cos (p0 - x sin<t>)d<t>. 

Then we can write 

CL 00 

(3.3.4) - = 1 + 2 ]T J p (pe) cos pM. 

r P . 1 

These coefficients are ubiquitous, and it is necessary to break off 
and derive some of their properties before continuing to develop any 
other series. 

3.4. Properties of Bessel's Functions. J p (x) was defined in (3.3.3). 
But since 


1 

2 7T 



sin(p0 


x sin 4>)d<t> = 0, 


we could have written 

If 2 ' 

(3.4.1) J > {x)==1 tjo E ~ iP * EUSm * d <t>' 

Now suppose that we wanted to expand the function E lxsm * as 


4- <® 

= £ a p E ip *. 

Then, from (3.3.2) we would find that a p = J p , so that 
(3.4.2) E 1 ™* = ^ J P (*)E* 

P* 

a formula that can be useful, incidentally, where trigonometric func- 
tions of trigonometric functions are concerned. 

Now put E» = z, so that 2i sin</> = z - 1/z. Then (3.4.2) becomes 
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The left-hand side of (3.4.3) can be written as the product of 


i(\*) *■/«* and z ( - 1)* Q xj 

To find the coefficient of z p put a = (i + p. Now a cannot be negative, 
so that for p ^ 0, 


(3.4.4) J p ( x ) = £ ( - 1)« 

0=0 


+ P) 


(\ Y + » 

A-2 X ) 


For p < 0, the summation runs from j3 — — p, — p + 1, • • •. The 
series (3.4.4) is absolutely convergent for all x . 

In (3.4.3) change z to — z, and x to — x\ the left-hand side is the 
same, so that 

c7p<x) = (-l)V p (-*). 

Also, change z to — 1 jz. The left-hand side is still the same, so 
that 


J P (x) = ( - 1 YJ- P (x). 

Combining these two results, we find 

(3.4.5) J_ p ( - x) = J p (x). 

Differentiating (3.4.3) with respect to 2, and using (3.4.3) to re- 
move the exponential on the left-hand side, we get 

s*(l + 1/2 2 ) Z J P {x)z p = Z P J p (x)z p -\ 

P — — OD P = OQ 

So that from the coefficients of z p ~ l , we find 

(3.4.6) !*[«/,_!(*) + J p+l (x)] = pj p (x). 

Similarly, differentiating (3.4.3) with respect to x , and considering 
the coefficients of z p , we can find 

(3-4.7) | [Jp-M - J p+1 (x)]= J' p (x). 

Differentiating (3.4.7) with respect to jc, we have 
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j;(x) = M-M 


-J'p+rix)} 


= 7 [ J P - 2 (x) - 2 J p (x) + J p+2 (x) ] (from (3.4.7)) 

4 

= \ [ - x (P - l)J P - M - J P (x) - 2 J p (x) + 

+ - (p + 1) J P+ i(x) - J p (x) ] (from (3.4.6)) 

= — e/p(*) + — [ (P — l)«/p-l(*) + (P + l)tfp+lW] 

= - J p (x) + ^2 Jp(x) - ^ J'p(x) (from (3.4.7)). 

So J p is a solution of the equation 

y" + ^ y' + (1 - p 2 /x 2 )y = 0. 

The general theory of Bessel’s functions can start from this equa- 
tion; but this is not needed for our purpose. We need only the 
solutions of the first kind, with integral values of p, and the defini- 
tion given above is sufficient. 

The series (3.4.4) demonstrates that the J p can always be calcu- 
lated. But there are many alternative methods of calculation, using 
such devices as recurrence relations, or continued fractions. See [ 1 ] - 
3.5. Applications of BesseVs Functions. Consider the expansion of 
sin mE. It is an odd function of E or A/, so that 


2 _ 

sin mE = - 2J sin pM 

A" p— 1 


JT- 


sin mE sin pM dM . 


Now sin pM dM — — (l/p)d(cospM), so that, introducing E , we can 
write 


sin 


mE = - 


2 ^ sin pM 


'* 1 P 

and, integrating by parts, 


J sin mE d[ cos(pi? — pe sin E) ] , 
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„ 2 A sin pM (T “I- 

sin mE — - - X — - — { sin mE cos(pE — pe sin E) 

ir p=i P (L Jo 

Jo m COS cos (pE — pe sin E) dfij . 

The integrated term vanishes at the limits; using the formula for 
the product of two cosines, the integrand can be developed to give 

J ^ | cos [ (p + m)E — pe sin E] 

+ cos[ (p - m)E — pe sin £]} dE 


„ m " sin pM 
sin mE = ~2_ — — 

* P = i P 


A sin pM , 

= — — { «/p—G») + Jp +m (pe) }. 


P = 1 p 

When m = 1, we have, by (3.4.6), 


(3.5.1) 

Similarly, we find 


. „ 2 A sinpAf 

sin E = - £ — J p ( pe ) . 


e p-i P 


2 “ r * 

cos mE = a 0 H — cos PM J ^ cos mE cos pM dM 

. 2A _ _ C* m ■ 

— a ° + - 2^ cos pM - sin mE sin(pl? — pe sin E) dE 


p= i 


(after integration by parts) 


A cos pM / 

Go + m 2J — - — { J p _ m (pe) - Jp +m (pe) . 
p=i P 


Here 

a 0 


if" i r f 

= - I cos mE dM - - I cos mE (1 - e cosE) dE 

7 T 7T «/0 

= xX [ cosm£ ~ 2 ec os( m + l)-E- ^ecoslm - l)fij 

= 1 if m = 0; — e/2 if m = 1; 0 if m > 1. 


dE 
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In particular, using (3.4.7), 

(3.5.2) cos E = - ]■ e + 2 £ C ° SP — </p(pe). 

z p 

We now have enough formulas to expand quite a lot of functions 
as Fourier series in the mean anomaly. For instance, Kepler’s equa- 
tion combined with (3.5.1) will cope with E . r/a can be expanded 
using (2.1.3) and (3.5.2). X and Y of (2.1.2) can be found similarly. 
Sometimes a little ingenuity can help; in seeing, for instance, that 
X/r 3 = - a~ 3 d 2 X/dM\ Y/r 3 = - a~ 3 d 2 Y/dM 2 . Another example 
is 

sin/= y/ — e 2 )sinl£/(l — ecosJE) = cot<p * 

cos f is easily found from (2.1.1) and (3.3.4). A function such as (r/a) 2 
can be easily written down in terms of a Fourier series in E , and 
from there to one in M. And so on. Many more examples are given 
in [ l] and [4]. 

It should be noted that these Fourier series are valid for any value 
of the eccentricity; but if they are re-arranged as power series in the 
eccentricity, then the upper limit noted in §3.1 applies. 

In the series for a/r or r/a or powers of these, it is noticeable that 
the lowest power of e in any coefficient is equal to the multiple of M 
in that term; this fact is a great help when deciding where to trun- 
cate a series. Although the equality just pointed out is not general, 
the fact that the lowest order of e increases as the coefficient of M is; 
this is a characteristic of these expansions stressed by D’Alembert, 
which now bears his name. In the expansion of ( a/r) ±k times the 
sine or cosine of m times the eccentric or true anomaly, the lowest 
power of e is in general equal to the coefficient of M minus m. 

3.6. Postscript. The reader should be warned that the above notes 
are extremely incomplete. No mention has been made of expansions 
in powers of the time; nor of the first-order differences between two 
“nearly equal” elliptic orbits. These fall usually under the heading of 
“orbit determination” and are dealt with more than adequately in 
[3]. Nothing has been said about the proper choice of units, even 
though, without this, an attempt at practical calculation in celestial 


ELLIPTIC MOTION 


31 


mechanics may be stillborn. Also hyperbolic orbits have been 
neglected, in spite of their increasing importance. Bearing in mind 
these and other omissions, the reader should consult some of the 
references. 
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Matrix Methods 


Consider the system of n first order differential equations 

(1) dXi/ dt = fi{X i, X 2 , • • • » X n ; t ) , i ~ 1, 2, ■ • • , n, 

relating the n coordinates X ( and the time t. These can be written 
symbolically in the condensed form 

(1A) X' = f(X,f), 

where X and f are column matrices; the prime represents differ- 
entiation with respect to the time. 

Suppose that a solution X fi (f) has been found, having initial 
conditions 

X*(t 0 ) = x 0 . 

A “slightly different” solution, X fi + 5X, might be found by solving 
equations (1) again, subject to initial conditions X 0 + 5X 0 at t 0 . Then 
hX would be found by subtracting X R . But this approach can be 
extravagant in significant figures, and it is often better to solve 
directly for 5X. 

If the squares and products of small quantities are neglected, 
then <5X must satisfy the first variational equations of the system (1) : 
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5X( 


df 1 /dX l dfJdX 2 • 

■ ■ dfJdX n 


«X, 

bX ' 2 


3 f 2 / d X\ d f 2 / d X 2 • 

■ ■ df 2 /dX n 


«Xj 

jX' n _ 


Jfn/dX x dfJdX 2 ■ 

■ ■ dfJ6X n 


5X„ 


or 

(2A) 5X' = A5X. 

The solution X/?(0 will be called the “reference orbit”. Each of 
the partial differential coefficients in the n-by-n matrix A is evalu- 
ated along the reference orbit, so that A is a known function of 
the time. 

Equations (2) are solved when any set of n linearly independent 
solutions is known. Finding these may present difficulties; but 
suppose for the moment that we have such a set, and that it con- 
sists of the separate columns of the matrix with elements 5Xy(f). 
Since any linear combination of these columns also gives a solution, 
the columns of 

(3) Q(t 0 ,t) = [5^(0] [fatf(td)]- 1 

must all be solutions. The matrix Q(t 0 , t) is equal to the identity 
matrix when t = £ 0 ; this provides the necessary initial conditions 
to find its components by numerical integration. For example, 
equations (2) would be solved subject to the initial conditions 
5Xi = 1, dXi = 0 (i ^ 1) to give the first column. The initial 
conditions for the second column would be dX 2 = 1 , dXi — 0 (i ^ 2); 
and so on. is called the “matrizant” (or “fundamental solu- 

tion matrix” or “state transition matrix”) of the system (2). Since 
each of its columns satisfies (2), it must itself satisfy 

(4) = Aft, 
where 

If the function 5X(t) were to have initial conditions 5X(£ 0 ) = 5X 0 , 
then the appropriate solution of (2) would be 
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(5) 5X(0 = Q(to,t)tiLo. 

It is clear that t) is the Jacobian matrix with components 
dXi/dXoj, etc. 

Consider the relations between residuals 5X at times t 0 ,t l9 and 
t 2 . We have 

5X 2 = {l(t 09 t 2 )5X 0 , 

and 

SX 2 = n(t u t 2 )5X x 

= nitutjQdoitJdXo. 

Therefore 

(6) litter = n(fi,*2)n(Mi), 

a result that is also evident from the fact that ft(t 0y t) is a Jacobian 
matrix. 

Consider the application of the matrizant to some situations in 
the context of astronautics. Suppose that a reference orbit has been 
calculated. If some maximum permissible error at time t\ is speci- 
fied, then the maximum permissible error at any earlier time t 0 
can be calculated if tt(t Xy t 0 ) is known. If an error is observed at 
t [h the effect at a later time t x can be calculated using ^)* But 
if t\ is fixed and t {) varies it is obviously inconvenient to solve the 
equations for il(t 0y t x ) many times for different t 0y and it is better 
to put 

(7) 5X l = ir l (t u t Q ) 6X 0 , 

and solve the corresponding equations with the initial conditions 
applied at t x . We notice, incidentally, that 12' _1 Ui, t 0 ) = 
Furthermore, it is possible to avoid the inversion of the matrix; 
for let 


T(M)ft(M) = /. 

Differentiating with respect to t, and using (4), we find 
(8) T' = — T A. 

Equation (8) is called the “adjoint equation” of (4). (The use of 
the adjoint equations in this sort of context was first cultivated 
in ballistics, and is described in [l ].) 
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Now suppose that equations ( 4 ) and (8) have both been solved, 
the initial conditions making each matrix equal to the identity 
matrix at time t x . Writing (6) as 

(6A) Q(t 0 , * 2 ) = fi« 1 , t 2 ) t 0 ) = n(t u t 2 ) T(f„ t 0 ), 

we see that the matrizant relating any two times can be found. 
Normally X will have six components, of position 



X ' 


'x t " 


-JC' 


x 4 ‘ 

r = 

y 

= 

x 2 

, and velocity x r = 

y 

= 

X5 


. z . 


1 

CO 

X 

1 


.z’. 


x 6 


Let the matrizant in ( 5 ) be subdivided into four three-by-three 
matrices: 


( 9 ) 



U(t 0 ,t) 
W{t 0 ,t) 


V(t 0 ,t)- 

Y (t 0 , t) _ ' 


Suppose that an error 5 r 0 is observed at t 0 , and it is required that 
after a thrust has been applied there will be a velocity residual 
or,', such that 5 r at time t is zero. Then we have 

(10) 5 Tq = — V~ 1 U 8 r 0 . 


Consider motion subject to a force function R. The differential 
equations of motion are 

= X 4 , X ' 2 = X 5 , X£ = X 6 , X ' 4 = dR/dX u 

X5 — dR/dX 2, Xg = dR/dX 3. 


The first variational equations can be written as t>\' = AiX, where 


( 11 ) 


A = 


0 

0 

0 

d 2 R/dXl 

d 2 R/dX x dX 2 

a 2 ft/ax 3 ax, 


0 0 100 

0 0 010 

0 0 001 

d 2 R/dX 1 dX 2 d^/dXjXz 0 0 0 
d 2 R/dXl d 2 R/dX 2 dX 3 0 0 0 
d 2 R/dX 3 dX 2 d 2 R/dX I 0 0 0 
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Then substituting from (9) and (11) into the equation & = Ati, 
we find 

U f = W, V = Y , W' = BU , Y' = BV, 

from which 

(12) U" = BU and V" = BV. 

Equations (12) are to be solved subject to the initial conditions 

U(t { ) 7 t 0 ) — /, VM) = 0 ; 

(13) 

U'(to,to) — 0 , V (to, to) — I. 

The columns of U and V are six linearly independent solutions 
of the equation 

(14) 6r" = B6 r, 

which is the first variational equation of the equation of motion 
in the form r ” = grad R. 

Let <5r 0 and 5ro be the initial increments in position and velocity 
to be applied to the reference orbit at time t 0 . Then at time t 

(15) 5r= U(t 0 ,t)i 5r 0 + V(t 0 ,t)hv' 0 . 

6r is a solution of (14), and since the components of <5r 0 and 5r ( ) 
can be considered as six independent, arbitrary constants, it is clear 
that (15) is the general solution of (14). 

The matrizant and its components should always be considered 
as functions of two variables (two independent times). Now consider 

Z(t 0 ,t) = ( U(T y t) dr. 

Jk) 

We have 

dZ/dt = 1 + { {dU(T,t)/dt)dr 

Jk) 

and 

d 2 Z/dt 2 = ( (S 2 U(t 7 t) / dt 2 ) dr 

Jk) 

= [ B(t)U(r y t)dT = BZ. 

Jk) 
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So Z satisfies the differential equation as well as the initial condi- 
tions for V, and must therefore be identical with V. Hence 

(16) U(t 0 ,t) = - dV(t 0 ,t)/dt 0 . 

So the matrizant (9) can be written 


(17) 

Let 


Q(t 0 ,t) = 




- dV/dt 0 V(t 0 ,0~l 

-d 2 V/dtdt 0 dV/dt J' 


n -1 (M) = T(«b, o = -J. 

Then from (8) and (11) we find 


U' 

V' 


U V 


0 / 

W' 

Y' 


W Y 


B 0 


Therefore 

V' = — U, Y' = -W, U' = - VB , W' = - YB . 

So 

(18) V” - VB, Y" = YB , 

where 

V«b,ti = 0, Y (£o, fo) = 1 > 

V'(Mo) = - /, Y'(Mo) =0. 

Now JB is symmetrical, so that transposing equations (18), we find 
(18A) V" T = BV T , Y" T = BY t . 

Comparing (18A) and (19) with (12) and (13), we see that 
V= - V T , U = Y t , W = - W T , and Y= U T . 

Therefore 


(19) 


( 20 ) 


JTHM) = 


U 

V 

-1 

* T - yT \ 

_ w 

Y _ 


1 

1 

** 

1 


a result that applies in general when the original system of equa- 
tions is canonical. 

The components of a matrizant would normally have to be 
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found numerically; but in some cases it is possible to find them 
analytically. This is notably so in the case of Keplerian motion, 
for which the components of V are given in a paper in [2]. This 
matrizant has possible applications in perturbation problems in 
celestial mechanics. The components are most easily found, not 
by solving the differential equation, but by considering, from 
first principles, what the effects of errors in velocity at time t Q 
will be on errors in position at time t. 

Consider the equation 

(21) bX' = AdX + g(0, 

in which a “forcing function”, g(£), has been added to (2). The 
equation is no longer homogeneous, and one way to solve it is to 
take the solution of the homogeneous part, viz., 

(22) bX(t) = Q(to,t)6X o, 

and allow the arbitrary constants, 5X 0 , to vary. Then 

5X' - Q'bX 0 + 126X6 
— Aii(5Xo 125Xq. 

Substituting this, and (5), into (21), we have 
A iiflXo + 126X6 = AHbX {) + g, 

so that 



The complete and general solution is therefore 

(23) 5X = 12(fc, t)8Xo + «(2 0 , t) ( V~ l (t 0 , r) g(r) dr, 

A 

where 5X 0 is once again constant. This is the exact solution of (21), 
subject to the initial conditions <5X(f 0 ) = 5X 0 ; no conditions about 
orders of magnitude are imposed. The first term, which includes 
the arbitrary constants, is the complementary function, and the 
second is the particular integral. If the particular integral is to 
be found numerically, probably the best procedure is to solve equa- 
tions (21) subject to the initial conditions 5X(£ 0 ) = 0. (23) can be 
simplified by the use of the multiplication formula (6) to give 
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(24) 


5X = n(M)«Xo + 



r) dr. 


Another form is 


(25) 8X = Q(t 0 ,t)8X 0 + Q(s,t) [ Q(r > s)g(r)dr, 

Jt o 

where s is an arbitrary time, chosen to make 12 as simple as possible. 

If the equations of motion are in cartesian coordinates, then 
the first three components of g are zero; writing the last three as 
f, we have, from (24) 

(26) dr = U (to, t) 8Tq V (to, t) 8Tq ( V(r, £) f(r) dr. 

0 

Or, from (25) and (20), 


(27) 


5r= t/(M)5r 0 + V(t 0 ,t)8 r' 0 

V- V t (s,t) 


+ 


[U(s,t) V(s, 0]f r v f(r) dr. 
A L U (s,t) J 


In the case of disturbed Keplerian motion, s would certainly be 
a time of perihelion passage. Also in this case there are advantages 
in changing the independent variable from the time to the eccentric 
anomaly in the reference orbit. 
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The Lagrange-Hamilton-Jacobi Mechanics 


I. Introduction. Consider a dynamical system specified by a set of 
generalized coordinates q = (qi) (i — 1,2 , * • - subject to m holo- 
nomic constraints, of the form 

(1) <t>(qJ) = 0, 

where 0 = (0,) O' = 1,2, Then m of the coordinates can be 

eliminated; the remaining coordinates number 

(2) n = N - m, 

which equals the number of degrees of freedom. Such a system is 
said to be holonomic. If the constraint equations (1) do not contain 
the time explicitly, we say the system is scleronomic , in contrast to a 
rheonomic system, with time-dependent contraints. 

Lemma 1. The kinetic energy is a quadratic function of the general- 
ized velocities q. 

Proof. Let the system be composed of N particles of mass m * and 
position vectors r k of the form 

r k = r k (qj); k= 1,2, • • -,N. 


(3) 
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Then 

(4) ? k = r k , t + i\ ?i (ji, 

(5) tl = r't + 2 • r km <7, + r Kqi ■ r k , qj q, q jt i,j = 1,2, •••,«, 

where the subscripts t and q denote the arguments of partial differ- 
entiation, and summation with respect to repeated indices i,j is 
understood. The substitution from (4) and (5) into the defining 
equation 

( 6 ) ' T=±m k ti 

leads to an expression of the form 

(7) T=±<tGq + bq+±c. 

Here G is the square matrix, 

(8) G = (m k r kq . ■ r kqj ), 
b is a column matrix, 

b = (m*r M • r Mi ), 

and c is a scalar, 

(9) c = m k r{,. 

The bar over a matrix will denote its transpose; thus b is a row 
matrix. The conclusion now follows by inspection of (7). 

Corollary. If the system is scleronomic, the kinetic energy is a pure 
quadratic function of the generalized velocities . 

Proof. Since t is absent in (3) , it follows that r ktt = 0, so that 6 = 0, 
c = 0, and 

(10) T=±qGq. 

Observe that: (1) G is the metric tensor of the configuration space ; 
i.e., the space of the coordinates q; (2) G is symmetric and positive- 
definite; i.e., 

(11) G= G, |G|>0. 

The differential of work dW can be written 


( 12 ) 


dW= Qdq, 
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where Q = (Q,) is the generalized force . We say the system is mono- 
genic if there exists a potential function of the form 

(13) V= V{q,q\t ), 
such that 

(14) Q=±Vq-V„. 

We note that if the potential is not velocity-dependent, then V(j = 0, 
and 

(15) Q = - V q ; 

i.e., the force equals the negative gradient of V in the configuration 
space. A system is said to be conservative if 

(16) V= V(q). 

Generally, we define the Lagrangian function L by 

(17) L(q,q,t) = T- V. 

An axiomatic foundation of analytical dynamics is furnished by 
the Hamilton Principle. The latter asserts that 

(18) sf*Ldt = 0 

on a dynamical path connecting two fixed points q(t x ) and q(t 2 ) in the 
(q } t) space. If the system is holonomic, it is then necessary that L 
satisfy the Euler equations of the calculus of variations; 

<19> £ 

Define the set of conjugate momenta p by 

(20) p = L,){q,<),t). 

The latter equation can be solved for (j to yield 

(21) q = q{q,p,t) 

provided 

(22) |£«l^0. 



THE LAGRANGE-HAMILTON-JACOBI MECHANICS 


43 


Lemma 2. If a system is scleronomic and conservative, then |L W | ^ 0. 
Proof. From (10), (11), and (16) we deduce 

L = \qGq- V(q), 

(23) L„ = Gq, 

L„=G, 

|LJ =|G|>0. 

The Hamiltonian H of a system is defined by 

(24) H{q,p, t)=pq- L(q, q, t), 

where q appearing in the right-hand member is to be expressed as a 
function of q,p,t by means of (21). 

Theorem 1. If the system is scleronomic, and if the potential is 
velocity independent, then the Hamiltonian equals the total energy E of 
the system. 

Proof. From (23.1) and (20) it follows that 

(25) P 

q = G~ l p. 

Here we recall that G is symmetric and that the transpose of the 
product of several matrices equals the product of the transposed 
matrices arranged in the reverse order. Therefore, with the aid of 
(10), we obtain T = \qGq= \pG~ x GG~ l p, leading to 

(26) T=±pG~ l p. 

Finally, (24) and (17) imply H = 2 T — L and 

(27) H = T+ V=E. 

A set of 2ra variables (q,p) is said to be canonical if it satisfies the 
canonical equations of Hamilton, 

(28) q = H p , p = - H q . 

We shall show that (28) is implied by (18). In the original formula- 
tion of the Hamilton Principle, the end points were fixed in the con- 
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figuration space. We shall now modify the Principle by fixing the 
end points in the phase-space of ( q,p ). 

Theorem 2. The Hamilton Principle implies the canonical equations 
of motion . 

Proof. In view of (24), the integrand of (18) can be written as 

(29) L = pq — H(q,p,t), 

and can be regarded as a function of (q,p,t) and the derivatives 
( q,p ). Then the hypothesis requires that L(q,p,q,p,t) satisfy the 
Euler equations in the form 

(30) tL ,- L „ 

From (20) and (29), we deduce 

(31) Lq = p, L q = — H qy L p = 0, L p = q — H P y 

which together with (30) implies (28). 

Theorem 3. If the Hamiltonian does not contain the time explicitly, 
then it is a constant of the motion . 

Proof. With the aid of (28), 

ti=H t + jti v + pH p 

(32) = H t +H p H q -H q H p 

= H t , 

since the inner product of the vectors H q and H p is commutative. The 
hypothesis H t = 0 then implies H = 0 and H = const. 

Let the state of a system in the phase-space be specified by a 2n- 
vector 

( 3 3 , x- Q, 

and let D be the corresponding 2n-vector differential operator. We 
introduce the canonical matrix 4> 0 represented by 
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where 0 denotes the n X n null-matrix, and / the n X n identity 
matrix, and note the following identities: 

(35) 4*0 = — 4>o = 4>o \ 4*0 — — I, 1 4>o | = 1; 

i.e., 4> 0 is skew-symmetric and orthogonal. With the definitions (33) 
and (34), the canonical equations (28) assume the form 

(36) x = 4> 0 H x = <t>o DH . 

The extended phase-space of 2n + 2 dimensions includes the conju- 
gate variables 

(37) q 0 ^t, p 0 = — H. 

With the extended Hamiltonian, defined by 

(38) iT^H + Po, 
the canonical equations, 

(39) <7o = isC Po = - i& % , 

are, indeed, satisfied. The first one is equivalent to the trivial identi- 
ty 1 = 1; the second is equivalent to H = H t , which is equation (32). 
The invariant imbedding described in this paragraph eliminates the 
time from the Hamiltonian, and reduces t to the role of a parameter. 

II. Transformation of variables. Consider the transformation x^x' 
defined by 

(40) x' = x'(x). 

If the transformation should involve the time, the latter could be 
eliminated by the imbedding construction of §1. Let J denote the 
Jacobian matrix of the transformation, 

(41) J = x' z = { dxi/dxj ) , i,j = 1, 2, • • •, 2 re. 

The following properties of the Jacobian will be used: 

(1) d = dxD, where d is the total differential operator; 

(2) J= Dx'; 

(3) dx' = Jdx ; 

(4) a c = a b b c ; 

(5) a a = /; 

(6) a b = (b a )~ L ; 

(7) D' = J~ l D. 
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The proofs are quite elementary and will be omitted here. In the 
language of tensor analysis, we say that dx is a contravariant vector, 
whose transformation is described by the matrix J in (3) , while D is 
a covariant vector, transforming by the matrix J 1 in (7). The two 
matrices so related that the transpose of one equals the inverse of 
the other are said to be mutually contragredient. 

Theorem 4. If J is the Jacobian matrix of the transformation x f (x) 
on a canonical set x with the Hamiltonian H, then the equations of 
motion in terms of x' are 


(42) 

x' = 

where 


(43) 

^ = J <$>„ 1 


and the scalar invariant H is expressed as a function of x' . 

The proof involves the relations x' = Jx and D — JD f implied 
by the properties (3) and (7) respectively, and the use of the canoni- 
cal equation (36). 

The 2 n X 2 n matrix 4> defined by (43) is named after Poisson; its 
elements 4>y are the Poisson brackets, denoted by the symbol (i,j) 
in the present chapter. 

III. Canonical transformation. A transformation is said to be canon- 
ical if it preserves the canonical form of the equations of motion. 

Theorem 5. A transformation with the Jacobian J is canonical if 
and only if 

(44) e/4> oe7= 4> 0 . 

The proof follows from (36), (42), and (43). The Jacobi relation 

(44) is equivalent to the following: 

QqQp-QpQi =0, 

(45) QqPp~QpPi = L 

PqPp - PpPq=0 . 

The proof involves the block representations (34) and 
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with the use of the rule of matrix block-multiplication. 

Another form of (44) is 

(47) 4>oe7= J 1 4»o> 

which is equivalent to 

Qq Pp'y Qp — Qp'i 

Pq Pq'y Pp Qq'» 

Theorem 6. For any function S(q,p\t) with | ^0, the trans- 
formation (q,p) — » {q',p') defined by 

(49) P = S q , q' = S P ', H' = H + S t 

is a canonical transformation. 

Proof. Let (g,p) be canonical variables satisfying (28) with the 
Hamiltonian H. Define L and L f by 

Ldt^pdq- Hdt, 

L' dt = p'dq' - H' dt. 

Then it follows with the aid of (49) that 

Ldt-L'dt= S q dq + S p , dp' + S t dt - d(p' q') 

(51) =dS-d(p'q') 

= dWy 

where 


(52) w^S-p'q 

Hence 


(53) 



(L~L')dt=dW 


<2 


*1 


= 0 , 


since W is a function of q y p\ t and since the end points are assumed 
fixed in phase- space. On the other hand, 


(54) if* Ldt = 0 

in consequence of (28) . Hence & ftp L'dt = 0, or 
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(55) S J^ 2 (p' q' - H')dt = 0, 
which implies 

( 56 ) q f = H' py P'= -Hf 

by the argument of Theorem 2. Provided |Sqp | ^ 0, the first equa- 
tion of (49) can be solved for p' and the result substituted into the 
second equation. Then the transformation is specified by the equa- 
tions of the form q ' = q'(q,p,t) and p' = p'(<7,p,0- 

Corollary 1. The transformation is canonical if 

(57) pdq - Hdt- ( p ' dq' - H ' dt) = dW y 
where dW is a total differential . 

Corollary 2. The transformation is canonical if H' = H and 

(58) pdq - p' dq ' = 0. 

Of the 2 n arguments of S, n are old variables and n are new. Alto- 
gether four forms of S are distinguished, which are related to W as 
indicated below: 


(1) W =S(q,q'), 

P Sq, 

p' = - s, 

(2) W = S(q,p') — p' q', 

t3 

II 

II 

(3) W= -S(q',p)+pq, 

<7 = S p , 

P' ~ 

(4) W = S(p,p') -p'q'+pq, 

>0 

II 

1 

q' = Sy. 


IV. Some algebraic properties of a canonical transformation. A trans- 
formation such that q ' is a function of q only and p' a function of p 
only is termed an extended point transformation . 

Theorem 7. An extended point transformation is canonical if and 
only if (1) it is linear , and (2) the coordinate and the momentum trans- 
formations are described by mutually contragredient matrices. 

Proof. Since q' — q'{q) and p' = p'(p), it follows from (48.2) and 
property (6) of §11 that 

(60) q' q = (p'p) 1 = A, 

where the matrix A is independent of q and p. 
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Corollary 1 . A linear transformation of the form 

(61) q' = Aq , p' = Bp 

is canonical if and only if the matrices A and B satisfy the relation 

(62) A = B~\ 

In the proof we note that 

(63) q' q = A, p' p = B. 

Then (62) follows from (60). 

Corollary 2. If the coordinates are multiplied by a constant factor X, 
while the momenta are divided by the same factor , the transformation is 
canonical . 

The proof follows from (62), which is clearly satisfied by 

(64) A = \I, B=I/\. 

Theorem 8. The set of all canonical transformations is a group. 

In the proof, the properties of closure, associativity, the existence 
of the identity element, and the existence of the inverse are estab- 
lished with the aid of (44) , the rules of matrix algebra, and the prop- 
erties of the Jacobian, numbered 4-6 in §11. A useful identity, 

(65) J 4\) J — $ 0 , 

is derived from (44) by premultiplying the latter by 4> 0 J' 1 and post- 
multiplying by 4> 0 e7. 

Theorem 9. The Jacobian determinant of a canonical transforma- 
tion equals unity. 

Proof. Since |4> 0 | = 1 , it follows from (44) that \J\ — ± 1 . The 
negative sign is then ruled out by two circumstances: (1) the group 
of transformations is continuous; (2) for the identity transformation 
x' = x the relations J = I and | J\ = + 1 hold. 

V. The Hamilton-Jacobi equation. 

Theorem 10. If the generating function S{q f p',t) of a canonical 
transformation is so chosen that the new Hamiltonian H' is identically 
zero , then the new coordinates q' and the new momenta p' are constants 
of the motion , and S satisfies the Hamilton-Jacobi partial differential 
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equation 

( 66 ) H(q,S v t)+S t = 0 . 

Proof. The first part of the conclusion follows from (28) with all 
quantities primed and H f s= 0; (66) is a direct consequence of (49.3). 

Theorem 11. If the Hamilton- Jacobi equation (66) has a solution 
S(q, a, t) with a a set of n constants, then the functions q(t) determined 
by the equations S a = 0, with 0 another set of n constants, and the func- 
tion p(t) determined by the equations p = S q , are the coordinates and 
the momenta respectively in the solution of the dynamical problem with 
the Hamiltonian H(q,p,t). 

Proof. Let the new variables be defined by 

(67) p f = a , q f = 0 . 

By Theorem 6, S(q, a,t) is then the generating function of the 
canonical transformation (q,p) — > (0,a) with the new Hamiltonian 
H' = H + S t = 0, vanishing identically in virtue of (66). Since a and 
0 trivially satisfy the canonical equations with H' = 0, they are 
canonical variables. Furthermore, (0,a) — > (q,p)> being the inverse 
of a canonical transformation, is also a canonical transformation by 
Theorem 8. Therefore, (<?,p) are canonical variables with the Hamil- 
tonian //. If a and 0 are so chosen as to satisfy the initial conditions 
q(0) = q 0 and p(0) = p 0 , the solution of the dynamical problem is 
unique. 

Thus, if one succeeds in solving the Hamilton- Jacobi equation to 
obtain a function S(q,a,t), he has in effect performed a canonical 
transformation ( q,p ) — > (0,a) with H' = 0. In the new variables, the 
phase path shrinks to a point a = const. 0 = const. The old variables 
are then expressed with the aid of the function S in the form q(a , 0, t) 
and p{a 9 0,t). The detailed procedure is summarized below: 

(1) Construct the Hamiltonian H{q,p, t) of the system and set up 
the Hamilton- Jacobi equation (66). 

(2) Find a solution S(q,a,t) of (66) containing n arbitrary con- 
stants a. 

(3) Write the kinematic equations of motion S a = 0 , where 0 is 
another set of n arbitrary constants. 

(4) Solve the latter equations for q , obtaining q = q(a,0,t). 

(5) Determine the constants of integration a>0 from the initial 
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conditions q(0), <j(0). 

The Hamilton-Jacobi equation can be solved exactly if it is separ- 
able. 

VI. The separable case. A system is said to be partially separable 
if a solution of Hamilton-Jacobi equation can be represented by 

(68) S(q,t) = W(q) + J/ (*), 

where W is a function of q only, called the Hamilton Principal Func- 
tion, and a function of t only. 

Theorem 12. A conservative scleronomic system is partially separa- 
ble. 

Proof. Since t is absent in the Hamiltonian, (66) assumes the form 

(69) H(q,S q )+S t = 0, 
which separates into 

^ H(q, W q ) = a x = const., 

OT~ f 

y = - <x i 

by means of the substitution (68). Then (70.2) leads to = — a x t 
and 

(71) S=W(q,a)-a x t, 

where W is to be obtained as a solution of (70.1) with n arbitrary 
constants a. 

A coordinate is said to be ignorable if it does not appear explicitly 
in the Hamiltonian. By the argument of the proof of Theorem 12, 
such a coordinate contributes to S a linear term In Theorem 
12, the role of ignorable coordinate is assumed by the time. Another 
example is furnished by the azimuth coordinate 4> of a particle in an 
axi- symmetric potential field. 

We note that a x in (69) is the total energy by Theorem 1. The 
transformation equations are p = W qy and 

= Pi + t, 

(72) 

Wa 2 = 


A system is said to be completely separable if a solution of the 
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Hamilton- Jacobi equation can be represented in the form 

0 

(73) 

Qo = t . 

Then p = S q leads to 

(74) S(=Pi(qi,a ) 9 (i = 0, 1, 

so that each momentum is a function of the conjugate coordinate 
and the n separation constants a. In particular, for i = 0, So = Po = 
a x = — H by the argument of Theorem 12. This result is in accord 
with (37). In the process of solution, the Hamilton Jacobi equation 
splits into n + 1 ordinary differential equations (74). With the aid of 
(73), S is obtained by quadrature, 

(75) ( Pi(QifOt)dqi= f ( pdq-Hdt ). 

i=o J J 

In view of (29) and (18), the latter expression can be identified with 
the action , defined by 

(76) S = min Ldt. 

A useful criterion for separability is furnished by the Staekel Con- 
dition: In an orthogonal coordinate system q> there exists a non- 
singular matrix 4>y(^) and a vector ^(<?J with £, j =1,2, «-*,n 
such that 


(77) 


n 


1 

(*" l )u= 1 /fo, 


where are the elements of the diagonal matrix G, representing 
the metric tensor of the configuration space. 


Theorem 13. If a conservative scleronomic system satisfies the 
Staekel Condition , then the system is separable. 

Proof. In view of Theorem 1 and (26), the Hamiltonian can be 
written 
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(78) H= ±pG~\q)p+ V(q). 

The Hamilton-Jacobi equation (70.1) then becomes 

(79) i W q G- l (q)W q + V(q) = «,. 

The substitution 

(80) W-'ZWtoi) 

1 

converts (79) into 

(81) (W!) a /g u + 

We shall show that if the functions are defined by 

(82) (W/) 2 = - 2 fc + !>#«>, i,j = 1 , 2 , • • .,n, 

j 

where a; are the separation constants, then (81) is identically satis- 
fied. Let a and b denote the n- vectors defined by 

(83) a= ((W') 2 ), b=(g- 1 ). 

In the matrix notation (81) and (82) then become 

\ba + V=a lf 

(84) 

a = — 2\p + 2<t >a. 

By the hypothesis (77), 

(85) V, ( 1,0 = 

Finally, the elimination of a and b reduces (84.1) to ( 1 , 0 , • ■ = 

ai, which is readily seen to be an identity. 

Examples of a Staekel system for a particle are furnished by the 
following potentials: 

( 1 ) V = — 1 /r, where r is a spherical coordinate. 

( 2 ) V = — 1 /r + 3 J 2 [ci(cos 2 0 - c 2 )/2 r 2 -f c 3 /r -f- c 4 /r 3 ], 

where r, 0 , 4> are spherical coordinates and ^, 02 , 03 , c 4 are disposable 
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parameters. In the artificial satellite theory, c 3 = 0 was used by 
Sterne in [ 8 ], and c 4 = 0 by Garfinkel in [ 9 ], 

(3) V = -£/(£ 2 + i? 2 ), where £, t/, </> are spheroidal coordinates, 
was used in the artificial satellite theory by Vinti in [12]. 

(4) The most general separable potential in spherical coordinates 
is of the form 

( 86 ) V = fi(r) + f 2 (0) /r 2 + /,(*) /r 2 sin 2 6. 

The Staekel Condition is satisfied with 


/ 


(87) 


= 


ti = fit 
1 -r " 2 

0 1 

0 0 


esc 2 0 


VII. Conditional periodicity. For a Staekel system, (74) and (71) 
imply 


( 88 ) pi = W!(qi,a); ( i = 1,2, •••,«)- 

In view of (82), it then follows that 


(89) 


Pi = - 2 ti(Qi) + Z %(Qi)^j 

j 


= Fi(q i9 a). 


Thus the phase- path is decomposed into n two-dimensional curves 
of the form 


(90) p t =± VWfaa)). 

If 0 , the motion is real and can be classified as circulation or li - 
bration . 

A coordinate is said to circulate if it is an angle and if F* is 
bounded from above and below by positive constants. Depending 
on the initial conditions, the phase-path is one of the two branches 
of (90). Since G is a diagonal matrix, (25.1) reduces to 

(91) Pi = giiQi 

with gu > 0. Therefore, if <fc( 0) > 0 then <7,(f) > 0 and Pi(t) > 0 for 
all t , so that < 7 , increases monotonically from <?(0) to 00 . If Fi(q) is a 
periodic function, then p(q) is also periodic with a period 2 tt . As qi 
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increases by 2 tt, the coordinate is said to go through a cycle. 

A coordinate <7, is said to librate if there exist constants a and b 
such that 

(92) F ‘ >0 

Fi = 0 if <7, = a or q, = b, a s; <7,(0) ^ 6. 

It then follows from (91) and the continuity of <7, that </,(0 is 
“trapped” in the interval (a, 6), traversing it back and forth while 
the phase-point describes a closed loop p,(g,), symmetric about the 
<7-axis. Each closed loop constitutes a cycle of the coordinate. 

A simple pendulum librates or circulates, according as E < 2mgl 
or E > 2mgl. If E = 2mgl, the phase-path is traversed in infinite 
time, and the orbit is said to be asymptotic. 

The phase-integrals of the motion are defined by 


(93) 


1 , - § p, 


( <li,ct)dqi , (i = 1,2, •••,n), 


where the integral is taken over a cycle. Clearly, the n - vector J is 
a function of the ra-vector a only; i.e., 

(94) J = J{a) 


and is therefore a constant of the motion. Let the Principal Func- 
tion, written as W(q,J), be the generator of the canonical transfor- 
mation (q,p) — > (w, J) with H' = H = const. Then 


(95) p = W q , w = Wj. 
Since j = 0, w is ignorable, and 

(96) H = H(J). 


Indeed, the canonical equations are 

(97) j = 0, w = Hj, 

whose solution is of the form 


(98) 


J = const., 
w= pt + 8, 


where v and 5 are constants. The conjugate variables J and w are 
called the action-variables and the angle-variables respectively, and 
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v is the set of the fundamental frequencies of the motion. If (71) is 
now written in the form 

(99) S=W(q,J)-Ht, 

and differentiated with respect to J , it then follows with the aid of 
(95) and (98) that 

( 100 ) * = H Jt 5 = Sj. 

Thus the fundamental frequencies of the motion are equal to the 
partial derivatives of the Hamiltonian with respect to the action- 
variables. The result (98) is a special case of the following theorem. 

Theorem 14. If all the coordinates are ignorable then the conjugate 
momenta are constants of the motion and the coordinates are linear func- 
tions of the time. The theorem is valid if the words f coordinates and 
"momenta” are interchanged. 

We shall next examine the relation between q and w. A function 
f(x) is said to be quasi -periodic if it is a sum of a periodic and a 
linear function of x ; i.e., 

f{x) = rx + g(x ), 

(101) g(x+P)=g(x). 

Such a function is characterized by its secular rate r and its angular 
frequency u = 2n/P. 

Theorem 15. Each (librating / circulating) coordinate q is a {peri- 
odic / quasi -per iodic) function of the angle variables w. The period in 
Wi equals unity , and the secular rate equals 2tt. 

Proof. First consider the case where all the coordinates librate. 
By (95), both w and p are functions of q and the constants J . Hence 

(102) dw — w q dq = W Jq dq = W qJ dq = (pdq)j. 

Let the number of cycles traversed by q lie between m and m + 1, 
where m is a /i-vector composed of integers. In view of (93) , the 
integration of (102) over m cycles contributes to Au; the expression 

(mJ)j= m\ 


(103) 

the entire Aw is therefore 
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J r. <7(0 

pjdq < m + 1, 

9(0) 

where the integral is taken over a “fraction” of a cycle. If q goes 
exactly through m cycles, then q(t) = g(0), and 

(105) Aw = ra, 

and conversely. If a coordinate circulates, then it advances by 2?r 
each cycle. In the analysis, q{0) is replaced by 27rm, without affecting 
the conclusion (105). Therefore q, can be expressed in terms of the w’s 
in the form 

(106) <?, = 2irw,c + ^ Aj l) exp [ 2n \/ ( — l)(jiW l +j 2 w 2 + •••)], 

where juj 2 , • --,j n are integers, and c assumes the value 0 if q, librates, 
and the value 1 if it circulates. 

In view of (98) two types of motion are distinguished: 

(1) If vi are commensurable, then q(t) is a periodic function of the 
time, and the orbit is said to be periodic . 

(2) If vi are incommensurable, then q(t) is not a periodic function 
of the time, and the orbit is said to be conditionally-periodic . 

It is to be noted that (105) cannot be satisfied in the actual motion 
if vi are incommensurable. However, this circumstance does not 
affect the proof of Theorem 15, which deals only with the mathe- 
matical properties of the function q(w). 

Theorem 16. In a conditionally-periodic system the mean frequency 
of the coordinate q t averaged over an infinite number of cycles is equal 
to the fundamental frequency v, of the motion. 

Proof. The set defined by 

(107) & = vilv, (i = l,2,... f n) 

contains at least one irrational number. By a theorem of Dirichlet, 
the system of inequalities 

(108) |f,-- mi/N\ <Ar 1 ' 1/ " 

has an infinite number of integer solutions for N and m t . The division 
of (104) by the time interval T defined by v\ T = N yields with the 
aid of (98) 
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(109) 

and 



pj-dq 


= m i -m i I <N~ lln , 


( 110 ) 


m 

V - j,<ViN 


- 1 — 1/ n 


Therefore the mean frequency v m is given by 
(111) v % = lim m/T = v. 


We shall redefine J and w by means of 


( 112 ) 



Wi = 2i r(vit + 5i), 


— Til t (T,. 

That the new variables are canonically conjugate follows from Corol- 
lary 2 of Theorem 7. The new w^t) are angles described with the 
angular frequencies n the initial values u; ; (0) = <r t are the phase- 
constants. 


VIII. Application to the Kepler problem. The Kepler two-body 
problem is a simple example of a separable system. In the spherical 
coordinate system (r, 0, <f>) with 0 the complement of the polar angle, 
for a particle of unit mass, 


(113) 


V= 

T= |(/‘ 2 +r 2 0 2 + r 2 cos 2 0* 2 ), 


where n is the product of the gravitational constant and the mass of 
the central body. From (17) and (20), 

(114) Pi = t y p 2 = r 2 d\ p 3 = r 2 cos 2 00. 

By Theorem 1, 

(115) H = | (p\ + pi/r 2 + pi/ r 2 cos 2 0) - n/r, 


the Hamilton -Jacobi equation is 
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(116) (Sf + S 2 ,/r 2 + S 2 J r 2 cos 2 d)/2 - M /r + S, = 0. 


Since the Staekel Condition is satisfied in virtue of (86), let 


(117) S = S 0 (t ) + Si(r) -j- S 2 (d) + S 3 (<t>). 

Then, 

So = — <*i. 


(118) 


S( = (2<x x + 2 #t/r - a 2 /r 2 ) 112 = p u 
Si = («!-«! sec 2 6 ) 1/2 = p 2 , 

S3 = «3 = Pi, 


where the a’s are the separation constants. (The constants a 2 and a 3 
of (82) have been replaced here by ^af and |«|.) The integration 
of (118) yields 


(119) 


S = — <*it + a 30 + 


j'pidr + Jp 2 dff. 


From Theorem 11 follow the kinematical equations. 


J dr /p t = fa + t, 

( 120 ) “ <*2 fdr/ r 2 Pi + <* 2 fd6/p 2 = 0 2 , 

<t> — « 3 J' sec 2 0de/p 2 = 

For a bounded orbit, a x < 0, the integrals are evaluated as follows: 
Define new constants a, e , i, a, cu, 12; n by the equations 

«i = — m/ 2a, = M a(l - e 2 ) ^ cost = a 3 /a 2 , 

01 = v/fty 02 = W, 0 3 — 12, 

a 3 n 2 = M , 

and the uniformizing variables E, v, /, ^ by 

r = a(l — ecosl?), 
r = p/(l + e cosy), 
l = E — e sin £, 
sinfl = sini sin^. 


(122) 
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Then, 

(123) dr = ae sin EdE, —d(l/r) = e sin vdv/p, 
and 

/, = f dr/p, 

Jimnr 

(124) = J'r(/ 1 a) _1/2 [ (1 + e- r/a)(r/a - 1 + e)]' 1/2 dr 
= - (E — e sin E) = l/n. 



(125) = -a 2 j\(n/a) [a(l + e)/r — 1] [ 1 - a(l - e)/r ] ) 1/2 d(l/r) 


(126) 


= o, 


= a 2 d0/p 2 — J^a 2 (a‘i 

- f ( sin 2 


« 3 sec 2 0) 1/2 dd 


i — sin 2 0) 1/2 cf(sin0) 




(127) 


/ 4 = J^a 3 sec 2 0(a* 2 — <* 3 sec 2 0 ) 1/2 c/0 = cos i J^sec 2 dd\p 
= cos i Jd\p/(l — sin 2 / sin 2 ^) = tan _1 (cos/ tarn/0. 


The equations of the orbit now appear in the form: 

E — esinE = nt + a, r = a(l — ecos£), 



^ = v + w, sin 0 = sin i sin \p, 
<f> = il+ tan -1 (cost tan^). 


( 128 ) 
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It is easy to show that (1) the orbit is a plane curve; (2) the 
plane of the orbit is specified by its inclination i and the longitude 
of the ascending node ft; (3) v is the angle made by the radius vector 
with a fixed line in the plane; (4) hence the orbit is an ellipse, the 
fixed line is the line of apses , to is the argument of the pericenter, 
and v is the true anomaly ; (5) E is the eccentric anomaly , l the mean 
anomaly , n the mean motion; (6) \p is the argument of latitude. 

IX. The orbital elements. The elements of a Kepler orbit are the 
six quantities by which the orbit can be specified. We shall discuss 
six types of such elements. 

1. The Jacobi elements are the constants a and 0 introduced in 
the solution of Hamilton-Jacobi equation: 

a i — the energy 

a 2 — the angular momentum 

a 3 — the axial component of the angular momentum 
0! — minus the time of the pericenter passage 
0 2 — the argument of the pericenter 
/? 3 — the longitude of the ascending node 

2. The Kepler elements are the constants a, e, i, a, co, ft introduced 
in §VIII: 

a = — m/2«i, the semi-axis; 

e = [ 1 + 2 aial/n 2 ] 112 , the eccentricity; 

i = cos ~ 1 (a 3 /a 2 ), the inclination; 

a = pin = ( — 2«i ) 3/2 /3i/fi, the mean anomaly at t = 0; 

co = 02 , the argument of the pericenter; 

ft = 03 , the longitude of the ascending node. 

3. The action and angle variables are the J’s and the vf s defined 
in (112). In terms of the as, we calculate: 

J\ = ^<j)(2ai + 2 ti/r — al/r 2 ) l/2 dr = — 2a l )~ ll<1 - a 2 , 

(129) J 2 = — afsec 2 0) 1/2 do — a 2 — as, 



The integration in the first two lines of (129) has been carried out by 
means of partial differentiation with respect to the a 9 s. Indeed, with 
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J 


lai 



E= 2r 


E = 0 


- = m( — 2«i) 3/2 , 


(130) 


c/la2 

^2a 2 

e/2a 3 


-E 1 ' 


-v 


v=2w 

v=0 

v=2x 


-V' 


i /=0 

^ = 2x 

* = 0 


= - 1 , 

1, 

= — 1. 


From the relation dJ = c/ a cfa we derive J = f J a da + c, and (129) 
follows immediately within the additive constant c. To show that 
c = 0, we note that (1) e = 0 implies f = 0, p x = 0, Ji = 0, 2«i a \ + y 2 
= 0; (2) i = 0 implies 6 = 0, p 2 = 0, J 2 = 0, a 2 = a 3 . 

The a s can be expressed in terms of the J’s: 


(131) 


X Ji = m( — 2 aj 1/2 = y/ (fid), 

l 

= “ 2 ^ 2 + ^2 + ^ 3 ) 2 J 

Ot 2 — J 2 + c/ 3 , 


«3 — «/ 3- 

In view of (100), (112), and the relation H — a u we calculate the 
fundamental angular frequencies rii and the phase- constants <r, from 
the equations 

(132) rii = au t , ^ = S Jt , (i = 1,2,3). 

The first equation yields 

(133) tii = n 2 — = v 2 (Ji J 2 + Ja) 3 = m H ~ 2ai) 3/2 = ti . 

Since the frequencies are commensurable, the motion is periodic; 
since they are equal, the motion is said to be degenerate. In view of 
identities (1) and (3) of §11, the second equation of (132) can be 
written in the form 
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(135) 


<*j = 


n 0 (0 
n 10. 
1 1, 


Since S a = /?, (134) then leads to 


oi = n P i = 0, 

(136) <r 2 — nffi = 0 4“ 

03 = + ft “t - ft = a + w -J- Q. 

Finally, the angle- variables w t are constructed with the aid of 


(112): 

U) X = Tit + a = /, 

(137) 

W2 — l -\- U), 

= l -j- a) 12. 


4. The canonical elements of Delaunay can be obtained from the 
action and angle- variables ( w , J) by means of a linear extended- point 
transformation. The latter will be generated by a function S of the 
form 


(138) S = q' Ap, 

where A is a constant matrix. By Theorem 11, the transformation 
defined by 


p' = Ap, q = Aq' 

is canonical. As a check, we note that the relations 
( 139 ) Pp = A, q' = A~ l 

indeed satisfy the hypothesis of Theorem 7. Let q = w, p = J, and 
choose A as 


(140) 


A = 


1 

0 

0 


Then, the contragredient of A is 


1 

1 

1 


\ 


/ 
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(141) A~ l = — 1 1 0 

\ 0 -1 1 / 

and the new variables are constructed from 


which yields 


q' = A„ \ p' = Ap, 


\J (mo) 


Delaunay adopted the notation L = yj (na), G = a 2 , H = a 3 , g = fe 
h = In terms of the Kepler elements, 

L = \/(fia), l = nt+ a, 

(144) G = W(l-e 2 ), g = <o, 


H = G cos i y 


h = il. 


Conversely, the relations 

w>\ = /, = L G, 

(145) w 2 = l + g, J 2 = G - H, 

^ ”b J.\ = H, 

express the action and angle-variables in terms of the elements of 
Delaunay. 

Since the letter H has been pre-empted by Delaunay, it has be- 
come customary in celestial mechanics to use the letter F for the 
Hamiltonian. Another convention is the change of sign of the Hamil- 
tonian, which is equivalent to the interchange of the coordinates and 
momenta. Thus 

F = — = m 2 /2L 2 , 

(146) 

L = F t = 0, Z = - F l = h 2 /L*= n. 

With the exception of /, the Delaunay elements are constants in the 
undisturbed motion. 

An alternate derivation of Delaunay elements is based on the 
generating function 
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(147) S~ — a\t ft( — 2«i) 1/2 ft + 

where the unprimed a and ft are the previously defined elements of 
Jacobi. Then a[ = and ft = *S n , leading to 

~ 2«i) _1/2 = V (^a) = L, 

(148) ft = — ^ + m( — 2aj) 3/2 ft = — t T ft / ft, 

ft = n(£ + ft) = nt + (7 = L 

The remaining elements are unchanged. Since the old Hamiltonian 
corresponding to the Jacobi elements is zero, the new Hamiltonian 
becomes 


(149) F' = F+S,= -« 1 = m 2 /2 L\ 


in agreement with (146). 

The “slow” Delaunay set, obtained by the replacement of l by the 
constant a, can be derived from S of (147) with the term — a x t 
omitted. The corresponding Hamiltonian is zero in the undisturbed 
motion, in contrast to (150), which belongs to the original, or the 
“fast” set. 

5. A modified Delaunay set is constructed by a canonical linear 
transformation of the original set. With S of the form (138), and 


(150) 


A = 


/ i 
i 
0 


0 0 

1 0 

- 1 1 


we first construct the contragredient matrix. 


(151) 


A~ l = 


I 111 
Oil 
v 0 0 1 


and then derive the new variables from (139) as 


(152) L, G — L, H — G; l + g + h, g + h,h. 

We recognize l + g + h as the mean longitude, and g -\- h as the longi- 
tude of the pericenter. It is noteworthy that 

G - L = 0(e 2 ) , 

H — G= 0(sin 2 i/2). 


( 153 ) 
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6. The Poincare elements, 

L, l + g + h, 

(154) \/(2(L - G)) sin (g + h) 9 y/(2 (L - G)) cos (g + h), 

y/(2 (G - H)) sin/i, y/(2 (G - H)) cos K 

are derived from the modified Delaunay elements by means of the 
transformation 

(155) q' t = V( — 2<?,) sin Pi, p( = y/( - 2^) cosp* (i = 2,3). 

That this transformation is canonical follows from the fact that the 
Jacobian determinant is + 1 in each of the three phase subspaces. 
Hence the Jacobi relation (45.2) is satisfied. 

Poincare elements are especially useful for orbits of small eccen- 
tricity and inclination. Indeed, 

q<i € sin w , P 2 ^ e cos w , 

(156) 

qs — sin (i/2) sin A, pi — sin (i/ 2) cosn, 
where w = g + h. 

X. The method of variation of constants. A nonseparable system 
with a small parameter k can be solved approximately. In planetary 
theory, k is the ratio of the mass of the disturbing planet to the 
mass of the Sun; in the lunar theory it is the ratio of the mean mo- 
tion of the Sun to that of the Moon; in the artificial satellite theory 
k is J 2 * the coefficient of the second zonal harmonic of the geo- 
potential. In all cases, the solution can be expressed as a power series 
in k . 

The method of variation of constants involves the following four 
steps: 

1. Choose a “reference” potential V 0 for which the Staekel Con- 
dition is satisfied and for which the “disturbing” potential V\ ^ V 
— V 0 is of order k . 

2. Solve the Hamilton- Jacobi equation 

(157) H 0 (q,S q ,t) +S t = 0, 

corresponding to V— V 0 , yielding S(<7,«;f); then = fi furnishes 
the solution q i0) (t;a y p) of the “unperturbed” problem. (Examples: 
the Kepler ellipse for Vo = — 1/r and the Garfinkel and the Vinti 
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“intermediaries” in the artificial satellite theory.) 

3. Let Hi = H — H 0 = O(k), where H is the Hamiltonian of the 
original problem, and let S above be the generating function of the 
canonical transformation (q,p) — ► (0,«). The new Hamiltonian is 

(158) H' = H + S,= H 0 + S t + Hi = H u 
and the new variables (j 8, a) satisfy 

(159) p = H u , a = - Hi,. 

In matrix form, 

(160) e = <t> 0 H u , e = Q . 

These are the equations of the variation of the elements. 

4. Solve the equations of variation by successive approximation, 
such as the method of Poisson and the method of Delaunay. An 
epitome of the entire procedure was given by T. E. Sterne: “In order 
to solve the exact problem approximately, we first solve an approx- 
imate problem exactly.” 

An independent derivation of (160) is contained in the following 
theorem. 

Theorem 17. If the solution of the problem with H = H 0 + Hi is 
expressed in the same form x = x m (t,e) as that of the unperturbed 
problem with H = H 0 , but with e no longer constant, then e satisfies 

(161) <f = <t>H u , 
where 

(162) 4> = e x <t> 0 <r x . 

Proof. From (36), 

(163) x = 4*0 H x , x(o> = H Vl . 

By the differentiation of x = x(t,e), 

(164) x = x t + x e e'- 
- Since x = x m corresponds to e = const., 

(165) 


X(0) = Xt- 
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Let D x and D e denote the gradient operators in the x-space and the 
e-space respectively. Then 

(166) x e e = $o (H - H 0 ) x = $qH u = $ 0 D X H Y . 

By properties (6) and (7) of §11 (x e )~ l = e x an ^ D x= e x D e - The 
conclusion now follows with the aid of (162). 

If e is a canonical set of constants, such as 



then <I> = 4> 0 . In accord with the convention of §IX the coordinates 
and the momenta are interchanged while the sign of the Hamiltonian 
is reversed. If the disturbing function R is defined by R = — H u then 
the canonical equations for the variation of constants take the form 

(167) d=R 0 , (1 = - R a , 


or 

(168) e=4>o Re, «=(“)• 

XI. On the Poisson brackets. The equations of variation for a non- 
canonical set of variables involve the Poisson matrix 4>. The calcula- 
tion of 4> is facilitated by the use of the following two theorems. 

Theorem 18. The Poisson matrix $>(e) belonging to a set of varia- 
bles e is a function of e only, and is independent of the reference canoni- 
cal variables x appearing in the definition (162). 

Proof. Let x and y be two sets of canonical variables, and let 
<I>(e| x) denote 4>(e) defined with respect to jc. For the canonical trans- 
formation y x, with J = x y , (162) and (44) imply 

(169) $(e\x) = e x <b 0 e~ x = e x (x y ^ 0 x y )e~ x . 

By the associative property of matrix multiplication, property (4) of 
§11, and the reversal rule for transposition , we obtain 

(170) 4>(e|x) = (e x * v )<M*v^c) = e y ^ 0 e y = 4>(e| y). 

Theorem 19. If the two 2n-vectors 
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have the following properties : (1) e is a canonical set; (2) e f is derived 
from e by the semi-identical transformation of the form 

(17D q' = q'(q), p' = p, 

then the Poisson matrix ${e') has the structure 


(172) 


where Q == q'. 


$ = 


0 q' 

Q 0 


The proof follows from (43) with 


(173) 


J = e' e 


Q 0 
0 I 


As an example, let e f be the Kepler set of elements and e the 
“slow” Delaunay set of §IX. The semi-identical transformation 
e—*e f is characterized by 


(174) 


I a L 0 




e L e G 


ol 


\ 0 k; in j 


The partial derivatives are obtained from 


a = L 2 /m, e = (1 - G 2 /L 2 ) m , cos i = H/G, 

(175) 

n = n 2 a 3 , 

leading to 

a L = 2L/fi = 2/na = (1,4), 

£l = (1 -e 2 )/na 2 e= (2,4), 

(176) e G = — (1 — e 2 ) l/2 /na 2 e = (2,5), 
i G = cot i/na 2 (l - e 2 ) 1/2 = (3,5), 
i H = - csci/na 2 (l -e 2 ) 1/2 = (3,6), 

the remaining derivatives being zero. 

With e replaced by e ' , (161) becomes the celebrated equations of 
Lagrange for the variation of the Kepler elements. Explicitly, 
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d = (1,4) R„ 
e = (2,4) R,+ {2,5) R„ 
i = (3,5)R u +(3,6)R 0 , 

C= - (1,4) R a — (2,4) R e , 

« = - (2, 5) ft, — (3, 5) Ri, 

Q = - (3, 6) R^ 

It should be noted that a enters R explicitly through r, as well as 
implicitly through / = nt + a, with n 2 a 3 = p. Therefore, 


(178) 


<r= - (1,4) (R a ) - (1,4) n a R,t — (2,4) R„ 
d = (1,4) ft„, 


where ( R a ) denotes explicit differentiation. Since n a a = rt, 

(179) <i + rtt= - (l,4)(ft 0 ) - (2,4)ft,. 

The appearance of < outside the trigonometric term can therefore be 
avoided if we use, instead of a, the element a. defined by 


(180) 




ndt + o m . 


Then <r. and ( R a ) replace <x and R a respectively throughout (177). 
As a result of this transformation, double quadrature is required to 
calculate the perturbation 5l. Indeed, 

= ^ bndt “j - 
— ( 3n/2a)5a , 


(181) 

and hence 
(182) 


bl = 

6n 

6a 


-/ 


(2 Rjna)dt, 


<~jr 


{SRja 2 )dtdt + bo.. 


Another form of the Lagrange equations uses the modified Kepler 
elements a, e, i, c , w, 12, where ( is the mean longitude at t = 0, and w 
is the longitude of the pericenter, 


( 183 ) 


t = ff. + 07, 
w = a; + 12. 
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The effect of this transformation is to replace Q of (174) by 

'10 0' 

(184) Q' = Q l l 0 . 

i° 1 1 1 

The new Poisson brackets that are affected are: 

(2.4) ' = (2,4) + (2,5) = - (1 - e 2 ) ,/2 [l - (1 - e 2 ) ll2 ]/na 2 e, 

(185) (3,4)' = (3,5) + (3,6) = — tan^ i/na 2 (l - e 2 ) 1/2 , 

(3.5) ' = (3,4)', 
and (177) becomes 

d = (1,4) R., 

= (2,4)' R, + (2,5) R x , 

, 1QC , * = (3,4)'(f?, + flJ + (3,6)fl n , 

( 18 b) 

< = - (1,4) ft a - (2,4)' Re~ (3,4)' R„ 

*= - (2,5) R e — (3,5)' Ri, 
il= - (3,6) Ri. 

XII. On the Poisson method. The equations of variation of the 
elements are of the form 

(187) x = $DR = kf( x ,t); (&«1). 

The variables x and f are expanded into a Taylor series about the 
undisturbed orbit x = x (0> corresponding to k = 0. We have 

(188) f = f 0 + (f x ) 0 8x + -Sx ■ (fxx)o • &x + • • •, 
where 

; (189) 5x = X - JC 10 ' = kx w + k 2 x (2) + • • •. 

The two expansions are substituted into (187), and the coefficients 
| of like powers of k are equated, yielding 

i (0) = 0, 
x m = fo, 

i® = (fx)oX a) , 


( 190 ) 
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(191) 


je ( 0) = const. 



J f x (x m ,t)x a 'dt. 


for the perturbations of successive orders. 

Consider a problem of two degrees of freedom with R in the form 
of a Fourier series in the angle variables w with coefficients func- 
tions of the action variables J. Then 


(192) R= X ^hh («/)cosO'i n x + j 2 n z)t> 

0, - a» 

and jc p is of the form 

(193) x p = B 00 t + 'EB HI2 ex 1? [ V(- l)(jin l +j 2 n 2 )t}(j i n l +j 2 n 2 )- 1 
The perturbation terms can be classified as follows: 

(1) Secular term ZW, j\ = j 2 = 9, 

(2) Short- periodic terms, j\n x -\- j 2 n 2 = O(fcn). 

(3) Long-periodic terms, 7i^i+ j-z n z = 0(n), 

We observe that (1) only the angle variables have secular perturba- 
tions; (2) in the integration of the long-periodic terms there occurs 
a reduction of the order by unity; i.e., first-order effects appear in 
the expression for x (2 \ etc. 

In the artificial satellite theory, the long-periodic terms have the 
argument 2g — 2(w 2 — w { ) , which corresponds to j { + j? = 0 with 


(194) 


3 

U 2 — n \ — - M 2 Jz Wi(5 cos 2 i — 1) , 

4 

ksz J, - 10 


At resonance, i ^ 63°. 4, where the classical solution breaks down, 
special treatment is required. 

XIII. On the Delaunay method. Let the Hamiltonian F be of the 
form 


(195) 


F = Z Aij(L t (i) cos(i/ + jg). 
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As in the preceding section, the terms of F are classed as secular 
(i =7 = 0), the long-periodic ( i == 0, j ^ 0), and the short-periodic 
(i ^ 0) terms. Let the corresponding portions of F be denoted by the 
bar, the star, and the tilde; i.e. 

F = Aoo, F* = £ A oj cos jg, 

1 

(196) 

F= F-F- F*. 

Delaunay used a succession of canonical transformations to re- 
move, one by one, the periodic terms of F. In the limit the new 
Hamiltonian F' becomes purely secular; i.e. a function of L' and G f 
only. Then by Theorem 14, L' and G' are constants of the motion, 
and l\g f are linear functions of the time. With the generating func- 
tion S correlating the new and the old variables, the problem is 
solved. The feasibility of a single transformation to accomplish the 
same purpose was suggested by von Zeipel. It is often conveni- 
ent to remove first the terms of short period. 

XIV. On the von Zeipel method. The von Zeipel method (1916) 
will be illustrated here by a problem of two degrees of freedom. 
Without any loss of generality, we denote the canonical variables of 
the system by L, G, l, g, where l is the “fast” variable appearing in 
the short- periodic terms, and g is the “slow” variable that character- 
izes the terms of long period. Furthermore, let the undisturbed 
Hamiltonian F 0 be a function of L only. Then the undisturbed mean 
motion n is given by 

(197) n = - F 0L . 

We seek a transformation (L, G, l,g) — > (L', G\g r , h') such that the 
new Hamiltonian F' is a function of L ' , G' only. As noted in the 
previous section, the problem of motion is solved if we can determine 
F' and the generating function S(L', G', l,g). 

We write 


F — F 0 4- F i + F 2 + 

(198) F' =F^ + F[ + F' 2 + 

S = L'l + G'g + Si + S 2 + S* + * * • , 
where the numerical subscripts denote the order of magnitude with 
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respect to the small parameter of the system. The transformation 
equations are of the type (59.3) , and take the form 

L = Si = L' + Sn + S% + • • • , 

G = S g = G' + S\g + *S + • • • , 

(199) 

V — Si' = l + Sn' + *Sil' + * * * , 

g' = S(7 — g + §1(7 + SiV + • • • * 

The old Hamiltonian F is now expanded about L' , G f ,1, g by means 
of the Taylor series: 

Fq — [ F 0 + Fol(Su + S#) + — F 0LL Sh ] + • • * » 

(200) Fi = [F 1 + F 1L + F 1G (S W + S£) ] + * • • , 
F 2 -[F % ]+..: 

Here the expansion is carried out to the second order, and the 
bracket is used as the symbol of the substitution L— *L', G— »G'. 
Since S t = 0, (49) yields F' = F, and 

Fi + F{ + F 2 '+ ... = 

(201) _ _ 

[ Fq + Fi+ F 2 H - ^2 H“ Fl G^lg + *^2 i) ] > 

where <J> 2 is defined by 

(202) <t> 2 = F lL Su + F lG S u + - F 0 nS\i. 

Separation by orders of magnitude converts (201) into 
Fi = [F 0 ], 

(203) Fl = [F l -nS ll ] i 

F 2 — [F 2 +4>2+F igSJ — TlSy] . 

In our problem, the feasibility of the von Zeipel transformation 
requires that 

(204) F? = 0. 

If this requirement is satisfied, (203) can be separated by type of 
term; i.e., the secular, the long, and the short, leading to: 
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Ft = [F 0 ], 

^ = [^1 

(205) 0 =[F l -nS u ], 

F% = [ F 2 + $ 2 ]> 

0 = [ F* + $ 2 * + F 1G S^\ 

0= [F 2 -{- $2+ F 1G Sfg — nS2/] . 

Now^the six^ equations (205) determine the six unknowns Fq, F [ , 
F 2 , S u S *, S 2 . In particular, S is obtained from 

(206) S * = - [ f ( F 2 + 4*2*)^/F 1( ;J , 

£>2 = £ — J' (F 2 + $2 + Fi g*S J)d/ J . 

From the canonical equations in the new variables it follows that 

^ 207 ^ L' = const., V = - (F' L )t + /'(0), 

G' = const., g' = - (F' a )t + g'( 0). 

Finally, the generalized Kepler equations (199.3) and (199.4) are 
solved for l and g in terms of the constants I/, G' and of the linear 
functions g f of the time. 

The appearance of a small divisor F XG in the calculation of the 
long-periodic terms is clearly seen in (206.2). If this divisor vanishes, 
as in the problem of the Critical Inclination in the Artificial Satellite 
Theory, it becomes necessary to carry the Taylor expansion to the 
next higher term. 

Generally, the feasibility of the von Zeipel transformation is as- 
sured by the following Poincarb criterion , which is a generalization of 
(204): “The order of a periodic term in the Hamiltonian is at least 
one more than the order of the time derivative of its trigonometric 
argument.” 
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Stability and Small Oscillations 


about Equilibrium 
and Periodic Motions 



1441 ^ 


The general motion in a problem in celestial mechanics may often 
by exhibited as an oscillation or combination of oscillations about 
one or another of a number of configurations of absolute or relative 
equilibrium, or about a periodic motion (which may be one of a series 
of periodic motions). The periodic motion may be regarded as a 
generalisation of relative equilibrium, and variables may of course 
always be found in terms of which it appears as a relative equilibri- 
um configuration. The value of periodic motions as intermediate 
orbits is illustrated in Hill’s Lunar Theory [l] de Sitter’s theory of 
the great satellites of Jupiter [2], and in many theories of the mo- 
tion of minor planets and satellites involving commensurable or 
nearly commensurable periods (for example the Trojan group). 1 

Poincare’s treatment of periodic motions in general, and his ap- 
plication of it to the general motion problem of three bodies, leading 
to the proof of the existence of periodic solutions of three sorts in 
that problem, was the subject of lectures at the July, 1960, Summer 
Institute. 2 The first part of the present chapter deals with the 

^eee.g. E. W. Brown, Astronom. J. 35(1923), 69. Yale Trans. 3(1923), Parts 1 and 
3; D. Brouwer, Yale Trans. 6(1933), Part 7; W. M. Smart, Mem. Roy. Astronom. 
Soc. 62(1918), 79; and H. Hertz, Astronom. J. 50(1946), 121. 

See §§1,2 and 3 of the notes on Periodic orbits for the July 1960 Summer Institute 
in Dynamical Astronomy at Yale University, McDonnell Aircraft Corp., St. Louis, 
Missouri, 1961. 
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application of Poincare’s procedure to the main problem of the 
Lunar Theory, and the second part to an investigation of motion in 
the vicinity of a periodic solution or equilibrium configuration, using 
the equations of linear variation. 


M 



Figure 1 . Coordinates for Sun, Earth, Moon System 


I. Application of Poincare’s procedure to the lunar theory. 

a. Description of problem . The “Main Problem” of the moon’s 
motion is an idealisation in which we suppose the Sun, S, and the 
Earth, E, to revolve about their common centre of mass G with 
angular velocity n', and suppose the mass of the Moon, M to be so 
small that its effect on the motion of the other two bodies is negligi- 
ble. Then (see Figure 1) if r, r s and r £ are the position vectors of 
M, S and E respectively, referred to G, in a frame which is rotating 
with angular velocity n', the equation of motion of the Moon may be 
written 


(1.1) r'+2n'Xf + n'X (n' X r) = - Gmg(r 3 rfi) 


Gm s (t - fs) 
A 3 


where m s , m E are the masses of the Sun and Earth respectively, G 
is the constant of gravitation, and 

P = |r - r E | = EM, 

A = | r — r s \ = SM. 


We now use the position vector of the Moon relative to the Earth, 
p — r — r^. At the same time put p = Gm E and note that if a' is the 
radius of the Earth’s orbit about the Sun ( SE ), then 

Gm s = n rl a ' 3 


J 
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(I.2)p+2n' Xp + n'(n' p) - n ' 2 (r E + P ) = - !»- 

p 

where a' = r E - r s = SE = r £ very closely. Now 

A 2 = (a' + p ) 2 

= a' 2 + 2a' • p + p 2 . 


n'V 3 (p+a') 

A 3 


so 



and so equation (1.2) becomes, collecting terms, writing a' for r E , 
and neglecting p 2 in comparison with a' 2 , 

(1.3) p + 2n' X p + ■'(■' • p) = - + j^V + p) 

p a' 

(The latter approximation is equivalent to taking the limit as the 
Sun’s distance (a') tends to infinity while the mass also tends to 
infinity in such a way that the angular speed («') of the Earth about 
it remains constant.) Taking a system of axes with E as origin, axis 
of £ in the direction of SE, axis of in the plane of the Earth’s 
motion, and axis of f completing the right handed orthogonal set, 
we have 


n' = (0, 0, «'), a' = (a',0,0), p=(£,„,f), 
and so the equation (1.3) becomes 

k -2n'v-3n' 2 S = - *4 , 

P 

d.4) v + 2 n'* = -q, 

P 

t + n' 2 f = — 

P 

b. The circular solution , and motion in its vicinity . In the absence 
of the Sun, the last term of (1.2) does not appear, and also G coin- 
cides with E , so that ig ~ O. The equations are then 
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£ — 2 n f rj — rc /2 £ = 3“ , 

p 

• o in 1 

(1.5) >j + 2re'£ - n' v = 3 > 

p 

p 

which have the periodic solution 

( 1 . 6 ) £ — a cos vt, r/ = a sin vt, f =0 

where a is a constant and (v + n') 2 ci 3 — p. This represents an undis- 
turbed circular motion, in the plane of the Earth’s orbit. We will 
study possible motions in the vicinity of this circular orbit, in its 
plane, in the presence of the Sun under the conditions of equations 
(1.3) and (1.4). So we put 

( 1 . 7 ) £ = a cosi/£ + 5£, v = a sin vt + f = 0 - 

Then p 2 = £- + rj 2 — a 2 + 2 a(cosi^ 5 £ + sin^5r?) + 0(5£“, 5 tj 2 ) and 
equations (1.4) become, putting (with i = ( — 1 ) 1/2 ) 

w = 6 £(xi) + 1 S 77 , 2 = exp ivt, 

(1.8) w 4 - 2 in' w - n rl w - ^ (n' + v) 2 (w + 3z 2 u>) 

= ^n' 2 a(z+3z l ) + 0(w 2 ). 

In order to seek a complementary function, we try 

w = Atf 1 -T Bz 2 a , 

where A, B and a are constants to be determined. Substitution into 

( 1 . 8 ) leads to 

— | («*' + fi') 1 -f 2 d" ^ ) 2 | Az a — — (v n ) Az 

— | (2v — otv + n') 2 -\- ^ (^ + rc') 2 j Bz 2 

— |(„ + n') 2 Bz a = 0. 
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The coefficients of z" and z 2 “ must each vanish, and if A and B are 
not both zero, 

(av + n') 2 + - (u + n') 2 , | (v + n') 2 

= 0 

| O ' + ^') 2 , ( 2 * — a ? + n ') 2 + ^ 0 + rc ') 2 

or 

(1.9) (a — l) 2 (a 4- p)(a — 2 — p) =0, 
where p = n' jv. 

Corresponding to the root a = — p we have £ = —A/ 3, so that if 
A ~ P iQ, P and Q being real disposable constants, 

(1.10) W=(P+ iQ)z~ p - | (P - iQ)z 2+p , 

and a = 2 + p leads to nothing new. 

Corresponding to the double root a = 1 we try 

w = (C + inEt)z, 

which leads to 

|^ — — (v + n') 2 (C + C) — 2 v(v -T n')E 

— | 0 + n') 2 in(E — E)£ Jz = 0 

so that E must be real, and equal to — 3(1 +p)(C + C)/4. So if 
C = £ + iS, R and S being real disposable constants, 

E = — | (1 + p)R, 

and the corresponding part of the complementary function is 

( 1 . 1 1 ) w = (^R T iS — — in (1 -f- p) Rt^j z. 

For a particular solution we try 

w = A'z’ 1 + £'z 3 + (C' + intE')z, 
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( 1 . 12 ) 


(1.13) 


^ 3m 2 o(3p 2 + 14p + 19) 
16(p + 3)(p-l) 

= ft_ i, say, where m = n' / v, 

= - 9 m 2 a(p + l) 2 
16(p -(- 3)(p — 1) 

= b 3 , say, 


(1.14) 


4(1 + p) 
= c t> say, 


and we may take C' = 0. 

So the complete solution of the equation (1.8) is 
w=(P+ iQ)z- p -\(P- iQ)z 2+p 

(1-15) 

+ + iS + j^Cj - | (1 + p)R J ivt j z + 6_i2 _1 + 6 3 z 3 . 

c. The search fora periodic solution. Suppose that at t = 0 we have 
w = 0i + ip 2 and w = 0 3 + i0 4 , 
which corresponds to the general set of initial conditions 
(1.16) t = a + fiu 7} = p 2 , Z = 03 j v = av + 

This corresponds to 


01 — g P + R + 6-1 + 63, 

0 2 = |q + S, 

f} 3 = - | (1 -p) v Q-uS, 

04 — — g (1 2p)vP — —(1 + 3 p)vR 1^1 — i/ 6 _i -(- 3vb 3 . 


and 
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and 


P = 


3(1 + 3 p) , _ 3 

2(1 +p) Pl (1 + ph 


Q 

R 


2(1 +p) 

2(1 + 2 p ) 

1 +P 


fa 


fa 4 


2(1 + p)r 
2 

(l+p)*' 


fa. 


fa. 


S = 


(1-P) 
1 +P 


02 - 


(1 +P)v 


fa. 


where fa = 0i — 6_i — ft 3 and fa = 0 4 — vC\ + k6_ 1 — 3*»6 3 . 
Suppose now that at f = 2-k/v 4 r we have 

^ — 0i + 1 A 1 4~ ((02 4~ 1 P 2 ) 

and 


= 03 + 1^3 4- i(04 4- ^ 4 )- 

There is a relation between the ft, which arises from Jacobi’s integral 


(1.17) | (£ 2 4- v 2 ) - ^ 4- | n' 2 £ 2 = constant. 

Substituting the expression (1.7) gives 

i (a 2 */ 2 — 2av sin vtb% + 2ap cos^r;) 

— - + (cos vtb% + sin vtdij) 
a a 1 

Q 

+ - n' 2 (a 2 + 2a cosyf5£) 

+ 0(5£ 2 , 5t? 2 ) = constant, 

and consideration of the values at t — 0 and t — 2t t/v + r shows that 
a*/ sin + ap cos i>t^ 4 + (^ ~h a / ) 2 a(cos^r ^2 H“ sin^r^) 

+ 3n /2 a cospt^i = 0. 
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Therefore it is sufficient, in order to establish a periodic orbit, to 
satisfy 

(1.18) = ^2 = ^3 = 0, 

since = 0 is then assured. We may then take ft and r both to be 
zero, leaving ft, ft and ft to be determined. The equations to be 
solved are then 


fa = | (cos2t rp — 1 )P + \ sin27rp • Q = 0, 

3 3 

$2 = — | sin27rp • P + ^(cos27rp — 1)Q + 2*- j^c, — |(1 +p)J?J 
= 0 , 

and 

2 2 

= - 1/(1 — p) sin2irp • P — - *(1 — p)(cos27rp — 1)Q 

3 3 

— 2™ [ Cl — ^ (1 + p)# J “ 

which lead to 


P = Q — 0, and 
which in turn require that 

2ci 




2 c 1 

3(1 +p)’ 


ft = 0, ft = 


and 




(1 + 3p)ci 


3(1 +p) ” 3(1 +p) 

Then 5 = 0, and so the appropriate solution is (using p = m) 


w — 


2cj 

3(1 + p) 


z + b_ Y z 1 + fc 3 z 3 


(1.19) 


1 2 

= — - am z 
6 


^ am 2 z 1 + \ am 2 z 3 + 0(m 3 ). 
16 16 


The Jacobian determinant, which must not vanish if the expression 
for the ft appropriate to the periodic orbit are to be obtained as 
power series in m by the method indicated by Poincarft is 


\ d(ft,ft,ft) A=* = 0 


= — (1 — cos27rp). 

n 
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Correct to m 2 , the expressions for the coordinates are, then, 

£ = a ^1 — ~ m cos vt ^ am 2 cos3^, 

( 1 . 20 ) 

r) — a (^1 + 1 m2 ^j sin vt + ^ aml sin 3 vt. 

This orbit crosses the axis of £ at right angles, and is symmetric 
about it. It is, in fact, the intermediary orbit used by Hill in his 
Lunar Theory [ 1 ] . 

II. The equations of linear variation. 

a. General formulation. Suppose a dynamical system is described 
by means of variables x u x 2 , • • • , x n which satisfy equations of the 
type 

dx- 

(II I) -^ = Xi(x u x 2 , ■■•,x n ) (i = 1,2, 
which possess a periodic solution given by 

(H*2) *, = 0, W (i = 1,2, •••,«) 

which has period T. Then for motion in the vicinity of this solution 
(11*3) = ^(t)+£i (i = 1,2, •••,«) 

where the £, are small. Substitution in (II. 1) gives 

(ii -4) § =E^-u 1 (o ) ^(o,*--,^«))^+o(f 2 ) 

at j= i o£j 

or, to first order in the £,, 

(II.5) | = ZA ij(( )^ (i =1,2,*.., i») 

Clt j = 1 

where Ay(t) = (dXi/ d£ ; ) (<t>i(t) , </> 2 (0 , * * *,</>n(0)- These are called the 
‘‘equations of linear variation.” 

Now if £P (0, £i 2) (£),•• - ,£i n) (t) are a linearly independent set of 
solutions of the equations (II.5), any solution may be written in the 
form 


(II.6) 


?,(*) = (t = 1,2, ---,«) 
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where the £ 0) are appropriate constants. Now the fait) have period 
T, and therefore so does So if & = tf(t) is a solution of (II. 5), 

so also is & = f * ( T + 1 ) . In particular so is & = (P ( T+t ) which may 
therefore be written, as in (1 1. 6), 

(11.7) {,W (T + o = £ fi 0 ** £**> (0 (i = 1, 2, • • • , n) 

Jfe-1 

for appropriate 

We now seek a solution fc(f) which has the property that 

(11.8) ii(T+t) = (i= 1,2, • • *,n), for all t. 

Iffc(f) is written in the form 

ii (0 = Z (0 (i = 1, 2, . • • , b) 

y=i 

then this requires that 

£ ^ (T+t) = \ZB» #(t) (i = 1, 2, ■ • • , n) 

J-l ;=1 

or, using (1 1. 7), that 

£ £ W = x£ BWfcW «) (i = 1 , 2, • • B). 

j = 1 *=1 7=1 

The (i) are a linearly independent set, and so we must have 

n 

Z B ( *)£ (M = (j = 1,2, • -•,«). 

*=1 


Thus X must be an eigenvalue of the matrix that is a solution 

of the equation 

B ( u)_X, B u.2> > ... f B u.n) 

B (2,l) f B ( 2.2) _ X> ... ( B <2.»> 


(II.9) 






D(n.n) 
, * * * , *-* 


X 


and (B (1) ,B (2) , is a corresponding eigenvector. 
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If X is such an eigenvalue, then the value of a satisfying 

(11. 10) X- exp (ocT) 

and whose imaginary part, is in the range — tt/T ^ yf(a) 

^ 7 r/T, is called a “characteristic exponent” of the solution (II.2). 

If the n solutions of the equation (II.9) are different, then the n 
corresponding eigenvalues are linearly independent, and the n cor- 
responding solutions £, — £i(t) are linearly independent, and so form a 
complete set. So if the eigenvalues are X (1) ,X (2) , •**,X (,l) , the corre- 
sponding characteristic exponents are a (1) , a (2) , • • •,a tn) , and the cor- 
responding solutions are £l l) (t), £ 2) (f), respectively, then 

any solution may be written in the form 

n 

(11.11) {,(t)=EC*{]‘ ) W (i = 1,2, 

k=l 

where the C k are constants to be determined. Since ^ (0 satisfies 
(II. 8) if the corresponding characteristic exponent is a {k \ we have, 
in view of (II. 10), the result that the function 

(11.12) xi k) ( t ) = exp ( - «<*> t) H k) ( t ) 

for i = 1,2, is periodic with period T. Therefore the general 

solution (II.ll) of the equations (11.5) may be written in the form 

n 

(11.13) m = Z C k exp (a ik) t)xl k> ( t ) (i = 1, 2, . . . , n) 

where the x! k) (0 are periodic functions with the period T of the 
solution (II.2). 

If two or more of the solutions of equation (II. 9) are equal, then 
the form (11.13) does not usually give the general solution of the 
equations (II. 5). Suppose the two of the solutions are equal, so that 
there are n — 1 distinct eigenvalues, X (1) , X (2) , * -,X tn_1) , correspond- 
ing to which there are n — 1 linearly independent solutions, 
(0, £ 2) (t), • • -,£7 n_1) (t), respectively, of (II.5). These do not form 
a complete set, so we can find a solution, say £*(£), which is not a 
linear combination of them. Now the set £l l} ( t ), $ 2) (0, • ■ •, (0, 

£*(£) are linearly independent, n in number, and so form a complete 
set, and so the solution £*(T + t) may be written in terms of them as 

Z*(T + t) = £ 6*??’ ( t ) + 6„ £*(<)• 

k=l 


(H.14) 
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Now consider the solution 

n - 1 

di. 15 ) it) = z c k il k) c t ) + C'trv), 

k=i 

where the constants Ci,c 2 , • • - ,c„ are still to be determined. We have, 
since the solutions £l l) (t), •,£}'*“ 11 (t) satisfy (II.8), and from (11.14), 

(! n> (T+t)=Zc t X« £*> (t) + c J Z bkii k) V) + b m tf (t) ] 

*-i ' *-i ’ 

= z (c*X W + cA) i. W ( t ) + b n f £ n) (t) -Z c„& k) (t) } , 

*-l 1 *=1 ’ 

(using (11.15) again), i.e., 

(11.16) (T+t) = Z - b n ) + c n b k \Zi k) (t) + bjl n) (t). 

k=l 

Suppose, if possible, that b n is different from all of the X (A) . Then we 
may choose c k = c n /(b n — \ {k) ) (A = 1,2, - — 1), and (II.16) be- 

comes 

& n) (T + t) = b n &> (0, 

showing that (0 satisfies (II.8), and X = b n must be a solution of 
equation (II.9), contradicting the assumption that b n is different 
from all of the roots of that equation. Therefore b n must equal one 
of X (A) , and by appropriate enumeration of the latter we may take 

b n = X (nl) . 

Also we may take c k — c n b k / (\ {n 0 — X ( * } ), k = 1, 2, * • *, n — 2, and 

(11.16) becomes 

(11.17) §*' ( T+t)= cA-itf 1 " 1 * (t) + X ( "- 1) «> ) (t). 

Now if i = 0, the functions 

x! n) (t) = exp ( - a {n ~ l) t) Q n) (t) 

have period T, and the general solution has the form (11.13), as in 
the case where all the \ {k) are different, except that now a (n) = a (n_1) . 
If on the other hand b n _ x ^ 0, we put c n = l/6„_i, and the function 

x! n> (t) = exp ( - « ( - ,, f) { g n> (f) - ^rrrr H n (*)} 
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is periodic with the period T. The general solution of the equations 
(II.5) now has the form 3 

fc(0 = £C* exp («<*>*) x l k) (t) 

k=l 


(11.18) 


+ C n exp (a*-* t) { x! n> (t) + xl n - 1) (0 } 

(i = 1,2, 


In the case of a greater multiplicity of roots of (II. 9), suppose the 
root \ is) is repeated p + 1 times, then we shall find that the corre- 
sponding contribution to the general solution is 

exp (««t) [ C, x/ s) ( t ) + C s+1 x/ >+1) it) + • • - + C qX W(t) 


(11.19) 


+ c, +1 { x r i, w+^xi 9, (o} 


T \ (s> 

+ c q+2 { x! q+2> (0 + ~ x. J,+1) (0 


{ 


+ 


Tx (s) 

t(t — T) M 
2!(TX W ) 2 X ‘ K 


} + - 


+ c s+p { x r p) 


( t ) 


T\ (s) 


xi s+p+1> («) + ••• 


+ 


f(t- T)(t- 2T) ...(t-rT) 
(r+ 1)! (TX ,s, ) r+1 



where the functions (0 have period T, and q is some number 
which will lie in the range s g q ^ s + p, and r = s + p — q — 1. 

b. The case of constant coefficients. If we are investigating motion 
in the vicinity of a configuration of equilibrium, then the quantities 
of (II. 2) are constants, and so the coefficients Ay in the equa- 
tions (II.5) are also constants. 

Suppose that the eigenvalues of the matrix (Ay), that is, the roots 
x = x u », x <2) , - • - ,x in) of the equation 


g 

The proof previously given for this result does 
to Dr. Barrar for casting doubt on it. 


not cover all cases. I am grateful 
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(11.20) 


An — * » A 12 , • * • , 

A 21 , A 22 — x, • • • , 


Ai n 

A 2/1 


= 0 , 


A 


«1 > 




Ann JC 


are different. Then corresponding to each jc 0) we can find an eigen- 
vector (p lJy p 2 j, -'-,Pnj) such that 

n 

(11.21) £ A ik p k , = x w p tf (i,y = 1,2, •••,«). 


Then the n eigenvectors are linearly independent, and the matrix 
( Pij ) is consequently nonsingular. If (p; ; ) is its inverse matrix, define 
new variables rj u ? ? 2 , * • • , by 

n 

(H- 22 ) * = (I = 1,2, • ••,«). 

y-i 


Then they satisfy the equations, using (1 1. 5), 


dij, 

dt 



ft ft 

= ZI p.jA*£* 

n n n 

= 2 S PijAjkPum, 

i - 1 *-i i-i 

using the converse of (11.22). Then (11.21) shows that 


(11.23) 


d^i 

dt 


j-i /-i 

- r (,) „ 

- X T) l 


This has the solution 
(11.24) ru(t) = ^(0) exp (x (t) 0 
and therefore the solution of (1 1. 5) is given in the case by 




(I = 1.2, . 

■,n). 

1! 

h- ‘ 

JSO 
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(11.25) I, (t) = X PijVji 0) exp (x u 't) (i = 1, 2, • • •, n). 

i = i 

The eigenvalues x 0) of (A, 7 ) are therefore to be identified with the 
characteristic exponents, and the periodic functions xF> (t) are re- 
placed by the constants p*,. 

c. The equation satisfied by the characteristic exponents. In Poincare’s 
notation [3, §§37-39, 11, 40, 42, 43 and 46-48], we have 

&( 0) - ft ^ 

(11.26) and (i = 1,2, . --,n), 

Zi(T) = ft + ) 

and, since the are zero when the ft are, we may put 

(11.27) *.= Z(^-) &+(tf), 

*=1 \ c / ftfe / 0 

for a sufficiently small range of the ft, where the suffix “0” indicates 
that the ft have been put equal to zero. But if a is a characteristic 
exponent we know that the equations (1 1. 5) have a solution of the 
form 

(11.28) £,■(*) = eexp(a*)xi(0 ( i = 1,2, - .-,n), 

where e is a constant and the functions x» have period T. Making use 
of the relations (11.26), 

ft = exz(0) and 

(11.29) ft + \pi = € exp (aT)xi(T) 

= f exp (aT)xi(0) ( i = 1,2, ...,n). 

Therefore the equation (11.27) may be written 

fjexp (aT) - 1 }x.(0) = i (^) < x *(0) + 0(t 2 ) 
k=i \dPk/o 

so that, dividing by e and letting e tend to zero, 

(11.30) jexp(«T) - ljxi(0) = £ (~) x *(0) (i = 1,2, •••,«). 

k=i \opk / o 

Since the Xi(0) are not all zero, the quantity x = exp (aT) — 1 must 
be an eigenvalue of the matrix (d^/dfo) 0 , that is, a solution of the 
equation 
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Wi/ 0 

x ... 

’ Wo ’ ’ 

/#A 
Wn/ 0 


/<¥ 2 \ 
\<W o’ 

W -x... 

/ 


' IWo ’ ’ 

\Wo 

(11.31) 


W2/0 ’ ’ 

(**) - 
\d0n/o 

or 

Wl/o’ 



det j (^-) — x6y 

|W,/o y 

= 0 , 


where 6 iy - = 0 if i ^ 7 , and 1 if i = j. If the characteristic exponents are 
different they are all given by the n roots of this equation. 

d. Some properties of the characteristic exponents. If the equations 
of motion (II.l) are real, then real values of the ft lead to real values 
of the lpi, so that the (d^/dftt ) 0 are real, and if a root of equation 

(11.31) is complex, its complex conjugate is also a root. Therefore if a 
characteristic exponent is complex, its conjugate is also a character- 
istic exponent. 

If the variables are subjected to a linear transformation 

n 

(11.32) m = Z TytoSj (i= l,2,...,n) 

y- 1 

where the T tj (t) have period T, then if 

*i(0 ) = ft' , and vi(T) = (3- + (i = 1,2, -•-,«) 

we have 

n 

0 ! = o)ft 
>=1 

and 

n 

Pi + */ = Z + fc) (i = 1, 2 , • • n) 

1 


so that 
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(11.33) = Z Tiji 0) = n) 

j - 1 

and 

(11.34) ft=tn-( 0)|9/ (» = 1, 2, •..,») 

j = i 

where (!’,>) is the inverse matrix to ( T, ; ) . The equation satisfied 
by the quantity x' = exp («' T) — 1 , where a' is a characteristic expo- 
nent derived from the equations for dr^/dt, is 


(11.35) 


det 



= 0. 


But, using (11.33) and (11.34), 



(Wk\ 

W;/<> 


and since 


n 


= IT»( 0) 



^(0), 


«« 




= £ 


^*(0) ^,(0), 


equation (11.35) may be written 


or 


(11.36) 


since 


det | T ik (0) | - det 


det 


m -x' 
V dpi / o 


S/u 


■'8*i 


(djh\ - X ' 

Ua / 0 

det| T, ; | = 1 -s- det | T y | . 


det | r„( 0 ) | = 0 , 


= 0, 


The equation (11.36) is the same as the equation (11.31) whose roots 
are the quantities x derived from the characteristic exponents arising 
from the which are therefore identical with those arising from the 
7 j ; . Therefore the characteristic exponents are unaltered when a linear 
transformation of the type (11.32) is applied to the variables 
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If the variables jq of the equations (II.l) are subjected to a general 
transformation 

(11.37) = fi(x l9 x 2 , . . • , x.) (i = 1, 2, - - ■ , n) 

then the periodic solution (1 1.2) is described by the values y, 
-/<{*i(f) f fcW, •••,*»(*)} (i= 1,2, ...,n) of the y, The equations 
of linear variation for the y,- are obtained by putting 

y* — l(0, * • *» } + Vi (i ~ If 2, • * *> n ) 

corresponding, to first order in the to the motion described by 
(II.3), namely 

x i = (pi(t) + ^ (i = 1, 2, • • • , fi), 

so that 

n df 

(11.38) Z ^ Ui(O,---,0n(O}i>. 

; = 1 C,X ; 

The coefficients in this last equation have the period T of the peri- 
odic solution (1 1.2), and so this transformation is of the type (11.32), 
and leaves the characteristic exponents unaltered. Therefore the 
characteristic exponents associated with a given periodic solution of 
a dynamical problem are independent of the variables used. 

Suppose the equations (II.l) possess an integral of the form 

(11.39) F(x ly x 2t ...,*,) = C. 

Then in the motion (1 1.3) we must have 

F(<t> 1 + 01, • • • , <f> n + fin) “ F( 4 > 1 + fil + • * * > <Pn + fin + ^n) > 


and so, differentiating with respect to ft, 


£***‘-o 

£i a*; aft 


O’ = 1,2, •••,«). 


Thus there is a linear relation between the columns of the matrix 
unless the solution (1 1. 2) corresponds to a stationary value of F 
with respect to all the and so equation (11.31) has a zero root, 
and there is a zero characteristic exponent. It can be shown that 
there is a zero characteristic exponent corresponding to each inde- 
pendent integral of the type (11.39) [3, §65]. This result is true even 
if the functions X, of the equations (II.l) depend explicitly on the 
time (with respect to which they must then of course have the period 
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T, if the solution (II.2) is to be possible). 

If the functions Xi do not depend explicitly on the time, then a 
solution near to (IL2) is 

Xi=<t>i(t+dt) (i= 1,2,. 

where bt is an increment in the time. This corresponds to the solu- 
tion 


(11.40) (* = 1,2, •••,») 

of the equations of linear variation. This solution has period T, and 
so corresponds to a zero characteristic exponent, which must exist 
even in the absence of integrals of the type (11.39), and may be 
shown to be additional to the zero exponents arising from such 
integrals [3, §§66 and 67]. 

Again, if the solution (II.2) is a member of a one parameter fam- 
ily of such solutions, all of the same period, 

Xi = (o’, t) (i = 1,2, •••, 71 ) 

then another solution is 

X( = <f>i{(T bo, t) (i = 1, 2, • ■ n) 

corresponding to 

(11.41) = (i = 1,2, •••,/»), 


which is independent of (11.40), and therefore corresponds to 
another zero characteristic exponent. 

e. The case of a canonical system. Consider now a dynamical system 
whose equations of motion may be put in the canonical form 


(II 42) __ dy l 

dt dyi 7 dt 


dH 

dXi 


(i = 1,2 , .--,n) 


where H is a function of the variables x u y x \ x 2f y 2 ; ** *; x n ,y n , of the 
problem. The variables may be arranged in conjugate pairs. Let 


(11.43) y, = 0„+i(O (i = 1,2, •• - ,n) 


be a periodic solution, the functions <t>i{t ) having period T. Then the 
equations of linear variation are obtained by putting 
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(11.44) JC,- = <*>,(*) + &, yi = <t> n+ x{t) +i)i (i =1,2, •••,«) 

and retaining only the first order in and which gives 

(11.45) dt j-Adxjdyi 1 dyjdy t Vj ) (f = i t 2, ...,») 

dru = y [ d^H ) 

dt ~l \ dxj dXi ^ ^ dy t dx, Vj ) 

the second derivatives of H being evaluated by use of the expressions 
(11.43). Then if (&,*?,) and (£/,*?/) are two solutions of these equa- 
tions we may easily verify that 


X (€i^i ~~ £*' Vi) 

i=i 

is constant. 

If a and a' are two characteristic exponents of the solution then 
corresponding to them there exist solutions of the equations given by 


and 


£, - exp («*) x; (0 
m = exp (a*) (0 


(i = 1,2, 


£/ = exp (a' 0 x/ (0 

(/= 1,2, •■•,«) 

m— exp («' 0 X n 4 i (0 


where the functions x/(0 and x!(t) either have period T y or are 
finite polynomials in t with coefficients which have period T, as we 
have seen in §IIa. Then the quantity 


exp (a + a) £ j Xi(0xi+» (0 ~ Xi (Oxn+i (0 ! 

i=i 

is constant, and so either a — — a', or the constant is zero. That 
(a + a') could be equal to an integral nonzero multiple of 2ni/T is 
excluded since the a are chosen to have imaginary parts in the range 
— tt/T to + t r/T, and if a = a = 7T i/T y then one of them may be 
taken equal to — 7 n/T without altering the results of §IIa, and then 
we have «=—«'. 

Suppose now that q of the characteristic exponents have the same 
value a. Then there are q linearly independent solutions corre- 
sponding to them, say 

£i r) = exp (at) xl r) (0, 


r?i r) = exp (at) Xn+i(t), 
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where r takes the values 1,2, Consider now those solutions 

for which 

(11.46) E (fen.* - &*) = 0, 

1=1 

for r = 1,2, 1 and <7. There are q such relations, and so we 

may use them to reduce the system of 2 n equations (11.45) by elim- 
ination to a system of degree 2 n — q , which therefore has 2 n — q 
linearly independent solutions. But the system (11.45) has 2 n inde- 
pendent solutions altogether, and so therefore q of these have 

X (& *?i r) — £i r) w) ^ 0 

for some r, and therefore there are q independent solutions cor- 
responding to a characteristic exponent a = —a, so that q of the 
exponents have that value. Therefore, in a canonical system, to each 
characteristic exponent there corresponds an exponent equal in mag- 
nitude but opposite in sign. 

f. The stability of an equilibrium configuration or periodic motion. A 
formal definition of the stability of a configuration was given by 
Liapounov [4], who called a configuration “stable” if, given any 
open neighborhood of phase space containing the configuration, then 
we can find another open neighborhood such that all motions origin- 
ating within the second neighborhood always remain within the first 
one subsequently. Thus we may call the solution (1 1.2), namely 

*i = (t>i(t) (i = 1 , 2 , •••,/!) 

of the equations (II.l) stable if, given any positive number e, we can 
find another positive number 5 (in general depending on e) such that, 
for any motion x t — x ,(£) satisfying 

n 

E l*-( f o) - 4>i(t 0 ) | < 5 

i= 1 

for some time to, we have 

n 

E |*i(0 - < e 

1 = 1 

at all times t after t 0 . 

If the equations of linear variation described the equations of mo- 
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tion of the displaced motions (II.3) exactly, then their solution 
(11.13), (11.18) or (11.19) shows that the condition for stability is 
satisfied if all the characteristic exponents have negative real parts, 
or if they all have negative or zero real parts, those with zero real 
parts being distinct. If one at least has negative real part, and the 
others have zero real part, the displaced motion will in general tend 
asymptotically towards the solution (II. 2) as the time tends to infin- 
ity. In this case the solution (II. 2) is said to be “asymptotically 
stable”. In the case of dynamical systems whose equations may be 
written in the canonical form (11.42), which is the case if the system 
is holonomic and the forces are conservative, which therefore applies 
in the case of the motion of celestial bodies under gravitational 
forces, we have seen that the characteristic exponents occur in pairs 
with opposite sign. Thus in this case, for stability, they must all be 
pure imaginary or zero, and different. Then the general solution of 
the equations of linear variation is a sum of sines and cosines of the 
time, and the definition of stability is satisfied. 

In most cases, or course, the exact equations of motion of the 
displaced motion will have terms of higher degree in the & than the 
first, and the effect of these terms must be considered. Liapounov 
[4] showed that they cannot alter the fact that a solution which is 
shown by the equations of linear variation to be asymptotically sta- 
ble is in fact asymptotically stable. Where all the exponents have 
zero real parts, and are distinct, however, these higher terms may 
prevent the foregoing formal definition of stability from being satis- 
fied. Birkhoff [5, Chapter IV] showed that, in this case, neverthe- 
less, if s is any integer greater than unity, there exists a finite number 
K s such that, given any positive number t (not larger than some 
definite quantity), then in all motions satisfying 

n 

X I *.-(*») < f 


we have 

| <2* 

i = l 

for all times satisfying \t — t 0 \ < (1 /K s t s ), thus placing definite 
bounds on the rate at which the displaced motion departs from the 
solution (II.2). 
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In the case of periodic solutions, the above definition of stability 
is rarely satisfied even in the equations of linear variation; for exam- 
ple, in a one-parameter family of periodic solutions, the period usual- 
ly varies with the parameter, and so in a disturbed motion whose 
initial conditions place it on a neighboring periodic solution to the 
original one, the point in phase space representing the motion is 
carried steadily further away from the corresponding point in the 
original motion, because of the progressive increase of the difference 
in phase, although the paths may always lie close together. The 
longitude of a planet or satellite may depart linearly from the cor- 
responding value in the original motion, because of a small change in 
the mean motion arising from a small constant change in the major 
axis, without this constituting what we would wish to call an in- 
stability in the original motion. This shows itself in the term linear 
in time arising in the solution of the equations of linear variation 
when two of the characteristic exponents have equal values. We have 
seen that a periodic solution will usually have a number of zero 
characteristic exponents, which will usually give rise to such terms. 
We are therefore led to define the solution (II.2) as “orbitally 
stable” if any displaced motion, originating within an appropriate 
distance in phase space of some point in the original motion, has no 
point which is further than assigned distance from all points of the 
original path. Each case must be examined on its merits, to discover 
whether the powers of time arising in the solution of the equations 
of linear variation are allowable in this sense. 
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Lectures 

on Regularization 


1. Existence and uniqueness theorems of ordinary differential equa- 
tions. a. Single Equation. Consider the first order differential equa- 
tion 

(i.D W =f(t ’ x) 

where / is a real- valued function of the real variables t, x. Let (f 0 , x (l ) 
be a particular pair of values assigned to the variables ( t , x) . 

Picard-LindelOf Theorem. Hypothesis: 1. f(t, x) continuous in 

- t 0 | ^ a,\x - x 0 | ^ b; 

2. \f(t,x)\ < A 0 constant ) in /?; 

X 

R 

* (*o> *o) 


t 


Figure 1 . Domain for Solving (1.1) 
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3. \f(t,x l ) — f(t, x 2 ) | ^ M\x 2 ~ Xi\ in R, M = constant. 

Conclusion: There exists a unique function <p(t), defined for all 
values of t in S: \t — t 0 \ < h = min(a, b/A), such that x = <t>(t) satis- 
fies (1.1) and the initial condition, </>(£ 0 ) = x 0 . 

Note. Condition 3 of the Hypothesis is called a Lipschitz condition. 
Th;s is a fundamental existence and uniqueness theorem in the 
theory of ordinary differential equations. Proofs of this theorem and 
others are found in standard texts on differential equations, such as 
[1],[2], [3], [4], [5], [6], and [7]. 

One method of proof, known as the Method of Successive Approxi- 
mations, follows the Picard method of solution of the integral 
equation 

X(t) = X 0 + f /{ T, x(t) } dr, 

0 

whose solution satisfies (LI) and reduces to x 0 when t = t 0 . Picard 
proved that in the interval S the sequence of functions 

x (1) (£) = x 0 + /{ r, x 0 } dr, 

o 

( j 2) x(2> ( t ) = x ° + T ’ X<1> ( T ) I 


X ik) (t) = JC 0 + f{r,X (k 1 j (t) } dr, 

converges to the desired solution. 

The existence of the constant A in Condition 2 of the Hypothesis 
follows from Condition 1. Actually, however, the continuity of f(t, x) 
is not necessary for the existence of a solution. Ince points out 
(see [ 1, p. 66]) that Mie has shown that solutions exist when f(t,x) 
is continuous in x and Riemann-integrable in t. The Lipschitz condi- 
tion is required to prove uniqueness. The significant point of this 
theorem is the narrow interval, S , of convergence that it guarantees. 
The solution may converge throughout a larger interval, but no gen- 
eral method of determining the exact boundaries of the interval of 
convergence has yet been discovered (see [ l]). We say therefore that 
solutions of (1.1) hold in the “small,” in general. 
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Cfti the other hand, if (1.1) is a linear differential equation, the 
interval, of convergence of the solution is the interval of continui- 
ty of the coefficients. Specifically, if f{t,x) = ax + b, where a, b are 
continuous functions of t in the interval I = [t u t 2 ] , then the solution 
exists and is unique in / (see [I], [6]). Thus in the case of linear 
differential equations we say the solutions hold in the “large.” 

b. System of First-Order Equations . Consider now the system of 
first-order equations 

daci , ,, x 

—rr = fi(t,x u x 29 — , x„). 


(1.3) 


dx n 

dt 


= f n (t,x u x 2 , • • • > Xn ) . 


The extension of the Picard-Lindelof Theorem to this system is 
stated as follows: 

Hypothesis: 1. Each f iy i = 1,2, is continuous in its argu- 

ments in R: \t — t 0 \ ^ a, \x x — jc?| S b x , • • •, \x n — x° n \ ^ b n . 

2. !/,■(*,*!, •• •,*„)! < A in R , i = 1,2, • • -,n. 

3. | fi(t,X l9 ---,X n ) - fi(t,x u ---.x n ) | 

< K\ | A i — Xj | + * • • + Kn \ X n ~ X x\ i n R » i = 1 » 2, * * • j n. 

Conclusion: There exists a unique set of functions <j>i(t) defined for 
all values of t in S: \t — t 0 \ < h = min(a, b x / A,b 2 / A, • • -,6 rt /A), such 
that jc, — <f>i{t) satisfies (1.3) and the initial conditions , <t>i{t 0 ) = x?. 


The solutions are obtained by the recursion formula 


(1.4) 


(t) = x [ 0) + 


f /i|r l ir(r) f ...,I?’ u (r)| dr, 

J /n 

i- 1,2, 


•,rc, 


in extension of (1.2). 

Again, if (1.3) is a system of linear differential equations, i.e., il 

fi — an x x + ha m x n + 6 : , i=l,...,n, where the coefficients 

a iu • • • , a,„, and 6, are continuous functions of t in the interval / = 

[ t u t 2 ] , then the set of solutions exists and is unique in /, and we say 
the solutions hold in the “large.” 

c. Equations of Order Greater than One. Finally, since the differen- 
tial equation of order p, 
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(1.5) 


d p x 

dt p 


_ f (. dx d p 1 x\ 

f V’ X ’ dt dt p ~ l ) 


is equivalent to the set of p equations of the first order 

^2, 

dy 2 

( 1 . 6 ) 


dy i 
dt 


dt 


= y 3, 


^ = v 

dt yp ’ 


dyp 

dt 


= f(t,yi,y 2 , •••,y P ) 


where we define 


_ dx _ d p ~ l x 

y i-x,y 2 dt ’-"’ y p- dt p ~ i’ 


it follows that if f is continuous and satisfies a Lipschitz condition 
in a domain R of dimension p, the equation (1.5) admits in a domain 
S a unique solution which, together with its first p — 1 derivatives, 
will assume an arbitrary set of initial conditions for the initial 
value t — Jo- 
in the case of a linear differential equation of order p > 1, 

/A d p x , / .\ d p ~ l x dx 

(1.7) °° ** * 

+ Qp{t)x = b(t), 

the equivalent set of p equations of the first order is 


dy i 
dt 


= y 2, 


dyp-i 

dt 


= y p, 


dyp = b{t) a p (t) a„-i (0 MO 

dt a o (0 MO yi a 0 (0 " MO P ' 

It follows that if a 0 (0, MO, - • • ,a,,(0 and 6(0 are continuous 
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functions of t in / = [fi,J 2 ]> and a 0 (f) ^ 0 in /, then equation (1.7) 
admits of a unique solution in / which, together with its first p — 1 
derivatives, assumes the assigned initial conditions for the initial 
value t 0 in I (see [ l], [6]). 

d. Analytic Differential Equations. A large class of physical prob- 
lems are governed by equations of the form (1.1) where f(t,x) is 
analytic in its arguments t and x in a region R: \t — t 0 \ ^ a, 
|jc — jc 0 | ^ b , including the initial values of the variables. By this is 
meant f(t, x) has continuous partial derivatives of all orders and is 
uniquely expansible in a power series in {t — f 0 ) and (x — x 0 ) which 
converges for all values of jc and t in R. The power series is the 
Taylor series 


fit , x) = f(t 0 , x 0 ) + 


dx/to. 


(x - x 0 ) + M- 


> x o 


+ 


if (m , 

2! ( \()x 2 /((),jo 


(x - jc 0 ) 2 + 2 


(§f\ 

V dt/io,. 

f — 0 

Xdxdt/^xo 


it ~ to) 


[x - x 0 )(t - t 0 ) 


+ 




+ •••. 


The Cauchy Existence Theorem applies to these analytic differential 
equations. It states (see [ 5]): 

If fit,x) is analytic in its arguments in R: \t — f 0 | = \ x ~ = b, 

including the initial values of the variables, then the solution x = 
(pit) of (1.1) is analytic in t y i.e., can be represented by a convergent 
power series in it — t {) ), in a sufficiently small interval S about t {) . 

The coefficients Ci,c 2 , • • *,c n , * • • of the series solution 


x(0 — T c ] (/ — to) + C'lit — ^o) 1 T * * * 

are determined by replacing x on both sides of (1.1) by this series, 
and then equating coefficients of like powers of it — t 0 ) . 

Since Cauchy, others 1 have extended the lower limits for S within 
which the solutions certainly converge, and Moulton (see [5, p. 27]) 
shows how one such S is automatically enlarged if f does not contain 
t explicitly (the differential equation is then said to be autonomous). 
In many cases the interval of convergence of the solution is larger 
than that predicted from the general theory, but there is no general 


l See [5, p. 38] for specific references. 
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method of determining the exact interval of convergence of solutions 
of analytic nonlinear differential equations (see [5]), hence solutions 
of such equations hold in the “small.” For example, consider the ana- 
lytic nonlinear problem 


x(0) = 0. 

Here the maximum interval of convergence predicted by the exist- 
ence theorem is \t\ < 1/2. On the other hand, direct integration is 
possible, and gives the solution x = tan t, which has the interval of 
convergence tt/ 2, approximately three times the predicted interval. 
The danger of assuming a large interval of convergence in the general 
case, where the exact solution is not known, is obvious. 

As might be anticipated, the interval of convergence of solutions of 
analytic linear differential equations is the interval of convergence 
of the analytic coefficients (see [5]), and thus the solutions hold in 
the “large.” 

The Cauchy Theorem is easily extended to systems of analytic 
differential equations of order one and higher. 

e. Singularities and Regularization. A singular point of a differen- 
tial equation (1.1) is defined as a point ( t,x ) at which one or more 
of the conditions of the hypothesis of the existence theorem are not 
satisfied (see [l], [6]). Such points can be found by inspection of 
fit, x). 

Thus, at a singular point of the differential equation, the existence 
theorem does not apply, and other methods must be sought for 
establishing the existence and nature of the solution. One such 
method is called regularization. 

Regularization is essentially a substitution of an analytic (regular) 
problem for the singular one. It is applicable to equations with re- 
movable singularities and is accomplished by a transformation of 
variables which effect the removal of the singularity. Thus the solu- 
tion of the regularized equation is the solution of the original equa- 
tion. 

An elementary example will illustrate regularization. Consider the 
problem 

^ *2:0, x(0) = 0. 

at x 
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This autonomous equation is singular at x = 0, the initial point, 
so the existence theorem does not apply in the neighborhood of this 
point = 0, x = 0). Elsewhere the equation is analytic. 

Introduce a new independent variable r through the transforma- 
tion dt/dr = x . Since x ^ 0, Ms a monotone increasing function of 
r, and dt/dr — 0 if and only if x = 0 (t — 0). Hence, there is a one- 
to-one correspondence between t and r for all f ^ 0, r ^ r 0 , where 
t = r 0 corresponds to t — 0. The modified problem is 


since 



x(r 0 ) = 0, 
dx _ dx dt 

dr dt dr 


The solution, analytic for. all finite r, is 


x — r — r 0 . 

To express x as a function of the original variable £, first obtain r 
as a function of t from the equation dt/dr — x. 

Integrating, 


t 




(r — r 0 ) dr = (t — r 0 ) 2 / 2, 


from which 


T — T 0 = (20 1/2 , T ^ T 0 . 

Then the solution to the singular problem is 

x = (20 1/2 , t ^ 0, 

analytic for t > 0. 

In the next section we will discuss regularization of the differential 
equations of celestial mechanics. 

II. Regularization of the relative n-body equations of celestial 
mechanics — a formulation of the problem, a. Introduction. In a gravita- 
tional force field containing n-bodies of mass m i = 1,2 , • • •, n, the 
equations of motion, relative to a nonrotating coordinate system 
whose origin is mass m u are given by the system of n — 1 vector 
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equations (see [8]): 


<iu) £?-- 


k 2 M j 

3 

Pi 


Pi 


+ k 2 £ m\ ^ i=2,3, 

>=2;;Vi LPy pj J 


where 

M( — nti + 

k 2 is the gravitational constant, 

Py is the vector from mass nti to mass m y , 

Pi is the vector from the origin (mass rrii) to mass m„ 

Pj is the vector from the origin to mass m jy and the real variable t 
denotes the time. 

Except in the case n = 2, there are no known solutions of these 
equations which give qualitative information of the motion for all 
values of the time for arbitrary initial conditions. 

The Cauchy Theorem establishes the existence of series solutions 
of equations (I LI) in a neighborhood of every initial time t 0 except 
possibly at collisions of two or more masses, which represent singu- 
larities of the differential equations (see [9]). 

This suggests one approach to the problem— write the series solu- 
tion at a point a short distance from a singularity, and by means of 
the process of analytic continuation derive successive sets of other 
power series which converge for values of the time successively closer 
to the singularity, but never reaching it, of course. By its construc- 
tion, this approach precludes any survey of the general character of 
the motion near the collision. 

Another theoretical approach, whose literature is much less 
abundant, is that of regularization of the equations of motion, i.e., 
removal of singularities. This would establish the existence of solu- 
tions at the singularities and might make it possible to determine 
the character of the motion near collisions. 

Several investigators have succeeded in regularizing special cases 
of the rc-body problem. G. D. BirkhofF and Levi-Civita have used 
conformal mapping techniques to remove singularities of the re- 
stricted 3-body equations. Levi-Civita removed one singularity by a 
clever choice of mapping, and Birkhoff succeeded in completely 
regularizing the restricted 3-body equations by another mapping 
(see [10]). Birkhoff used the new formulation to describe what he 
called the “states of motion,” i.e., the position and velocity vectors 
of a particle at an instant. 
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Sundman in [ 11] regularized the general 3-body equations, except 
in the case of triple collision, by a change of the independent vari- 
able involving explicitly the position coordinates of the bodies. 
Sundman proved the existence of series solutions convergent for all 
values of time. Unfortunately, however, this is not of great practi- 
cal value in studying the overall motion, because the particular series 
converges slowly for large time intervals (see [9], [12]). 

b. Transformed Equations . Restricting motion to the invariable 
plane of the solar system, with mass m i the sun, Equations (II.l) 
have the same form, but can now be written in complex notation as 



d 2 Zi k 2 Mi 

dt 2 ~ 1 2,- 1 3 Z ' 




(H.2) 

n 

+ k 2 X m i 

>=2;;Vi 

( Zj Zj) Zj ~j 

-\^j z.r’ kT’-l’ 

i — 2, 3, * - 


where 

z = 

*(0 + iy(t). 


♦ i i 


Now consider the conformal map f of the z-plane (invariable 
plane) into the w-plane, and denote by g the inverse of /. Further, 
denote by the real variable r the trajectory time parameter in the 
w-plane. Under this simultaneous conformal map and time transfor- 
mation the equations (II. 2) become 

d 2 Wi g"(Wi) /dwtf __ d 2 t/dr 2 dwj 

dr 2 + g'(Wi) \ dr / dt/dr dr 


(II.3) 



g'(Wi) l 


k 2 M t 

\g(Wi)\* 


g( Wi) 


+ k 



g(Wj) - g(u>i) 

|g(«>y) - g(Wi) | 3 


g(“fr) ~U 

|gK) 1 3 J ' ’ 

i = 2,3, • ••,«, 


where 
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The conformal property of the mapping specifies that 


dt_ 

dr 


g' (Wt) 


dwi 

dr 


dzi 

~dt 


i = 2,3 , 


where \dwi/dr\ /\dzjdt\ is the ratio of the speeds at corresponding 
points of trajectories in the 10 -plane and 2 -plane. 

Of the four nonlinear terms in equations (II.3), the two that are 
carried over from equations (II. 2) and exhibit the singularities of 
those differential equations are governed by the map g and by 
dtj dr. The nonlinear term 

g"(wd / dw\ i 2 

g'(Wi) \ dr) 

is governed by the map g and is seen to vanish if and only if the 
transformation consists of a rotation, magnification and/or trans- 
lation. It is unlikely, therefore, that (II.3) can be linearized as well 
as regularized. The presence in equations (II.3) of this new nonlinear 
term does not create additional singularities, since g' ^ 0, and hence 
does not affect essentially the existence problem. 

In particular, under the identity transformation, equations (II.3) 
become 

d 2 Zi d 2 t/dr 2 dzi 

dr 2 dt/dr dr 


(H.4) 



k 2 M, 


Zi -T k 


2 t m- 

j = 2:j*i L 




i = 2,3, 

the equations of motion in the 2-plane after a transformation in the 
independent variable. 

With the selection of regularizing mappings, solutions of equations 
(II. 3) may be written in the neighborhood of images of collisions. 
These solutions will immediately transform back to the physical 
plane via the selected mappings, without the necessity of solving 
equations (II. 2). 
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Restriction of the dependent variable transformations to con- 
formal maps appears to offer advantages. The theory of conformal 
mapping, with its established existence theorems, is available, and 
certain invariant properties of conformal mapping might be helpful. 

For example, equations, (II. 2) may be written in terms of the 
gradient V , 


- v„ (V. + *,), 


i — 2, 3, 


where 


(H.5) 

and 

4>i ( 2 2> 2 3> 


Vi(zd = 


k 2 Mi 

1**1 


■A-.*-)-** i 

j=2j*i V \ Z J z i\ 



Then, properties of conformal mapping give at once the equiv- 
alent of (1 1.3), 


d 2 Wj g"(Wj) / dw t V _ d 2 t/dr 2 dwj 

dj 2 g' (l^i) \ dr ) dt/dr dr 


(dt/dr) 2 

\g'(Wi)\ 2 


V^(V # -+4 >i)\ 


i = 2,3, 


where ( V, + 4>,)' is the transformed ( V, + 4>;). 

It might be hoped that this behavior of the potential V* + under 
conformal mapping is not affected by the interesting fact that this 
potential is harmonic (statisfies Laplace’s equation) in 3-dimensional 
Euclidean space but is not harmonic in Euclidian 2-space. 

III. Regularization of the relative 3-body equations in the case of 
binary collisions, using Sundman’s transformation. In this section we 
will show that the relative 3-body equations can be regularized, in 
the case of a binary collision, by a transformation due to Sundman 
of the independent variable only, and the coordinates of the bodies 
can be expanded in convergent series for all real values of time. 

a. Regularizing Function. The equations under discussion are ob- 
tained from (II. 4) as 
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d 2 z 2 d 2 t/dT 2 dz 2 

dr 2 dt/dr dr 

(III.l) 

fdt\ 2 i k 2 M 2 _ , t2 „ T 23-22 

-UM | 2! i* Z!+ * m i| 2 ,- 2! i* 


d 2 z 3 d 2 t/dr 2 dz 3 

dr 2 dt/ dr dr 

(III.2) 

fdt\ 2 f k 2 M 3 _ , l2 _ T 22-23 

■wt w 

Denote 

| 2 2 | = r 2 , \z 3 \ = r 3 , |z 3 - z 2 | = r 0 . 




Without loss of generality we will consider the binary collision to 
be that of m 2 with m 3 . The Sundman transformation which ac- 
complishes this regularization is given by the equation 


(III. 3) dt= (1 -e-^Hl -e"^)(l -e^ /l )d Tf 


where l is the positive lower bound of r 2 and r 3 , shown by Sundman in 
[ll]to exist. Denote 

4> k =l-e~^ 1 , k = 0,2, 3. 

Clearly 


0 < 1 - - ^ 0 * < 1 , 

e 


k = 2,3 


and 0 ^ 4> 0 < 1, the equality holding for r 0 = 0 . Then 0 ^ dt/dr < 1 
and (dt/dr) ro _ 0 = 0 . 



Figure 2 . Coordinates for Sundman Transformation 
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Using l’Hopital’s rule we see that 


dt/dr 02 03 -rail r 0 2 03 1 w 

hm = hm — =— e 01 = lim — — < j = m, 

r 0 — *0 r 0 r 0 — *0 * r 0 ^0 * * 


/7i a finite quantity, 
i.e., 


4^- = ow, 

dr 


or cft/dr vanishes to at least the same order as r 0 , as 7* 0 — >0. Also, in 
the same notation it follows that 


and 


0o = O(r 0 ) 

0*= 0(1), k — 2,3, 


i.e., 


10*1 ^ constant, t* o —>0; k = 2,3. 

Hence the binary collision will not cause a singularity of the differen- 
tial equations (III.l) and (III.2) provided the following quantities 
remain finite as t\>— >0. 


(III.4) 


dz 2 


dz x 


d*t/dr a 

dz 2 

dr 

y 

dr 

y 

dt/dr 

d7 


d 2 t/dr 2 dzj 

dt/dr dr 


To show that the limiting values of these quantities are finite, we 
will make use of the energy integral obtained from the relative 3- 
body equations of motion in their original form (II. 1), before the 
transformation of the independent variable. These equations are 


(III.5) 

d 2 p-z __ 

k 2 M \ p ' 2 + k 2 m 2 

Pi 3 
3 

el 

3 » 


dt 

P 2 

i 

~ P 23 

PaJ 

(III.6) 

d z p -, _ 
1.2 

yz MiPi 

+ k 2 m\ 

P23 

3 

-k] 


dt 

pi 

L 

P23 

P2J 

where 







M 2 = m x + 777 2 ) 


M 3 = 777 j + W 3 . 
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Figure 3. Vectors for Energy Integral 

Now multiply (III.5) by m^p 2 and (III.6) by m 3 ~p 3 and add the re- 
sulting equations to obtain 


X * Pi = k 2 ) “ 
i = 2 I 


P23 * P23 / P3 * P2 . P2 ' 

m 2 m 3 — 3 m 2 m 3 [ — f- 


P23 


/ P3 * P2 P2 • P3\ 
V P3 P2 / 


(III.7) 


P2 
P3 ' 


- M 2 m 2 ?Kp- - M 3 m 3 p -Kp\ . 

P 2 P3 J 


The term in parentheses on the right side can be written 


P3 * P2 , P2 * P3 _ P3 f~ — \ i P2 /— s — \ 

3 1 3 “ T * 1P3 ” P23) “T T * lp2-h P 23) 

P3 P2 


P 3 


P2 


(III.8) 


P3’P3 1 P2‘P2 — / P3 P2 

3 I 3 P23 * l — 

P3 P 2 \P3 P2 


Another expression for the last term of (III. 8) is obtained by sub- 
tracting (III.5) from (III.6). 

P 23 = P3 - P 2 = k 2 M 2 ^f- - k 2 m 3 + k 2 m 3 ^Y - k 2 m 2 ^Y 

P23 


P 2 


P3 


P2 


-k 2 m 2 ?£-k 2 M i p 3 


P 23 


3 

P3 


= — & 2 (m 2 + m 3 ) — k 2 m x ^~ 

P 2 P23 P3 

= - - k 2 (m 2 + m 3 ) . 

\P 3 P2 / P23 


P 3 P2 


P3 


p2 L 


[" P 23 + k 2 (m 2 + ^ 3 ) 

P 23 J 


Therefore 
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and 

(III.9) P 23 • ( ■ *§- - *§•) = P 23 • P 23 + k\m 2 + m 3 ) • 

\P3 P2 / « /WiL P23 - 1 

Then (III. 7) becomes, after substituting (III. 8 ) and (III.9) and 
simplifying, 


m 2P2 * P2 + m 3P3 * P3 + 


TYl 2 W3 u- 
P23 ' P23 


= - k 2 M m 


P 2 * P2 

! 3 

P2 


+ wi 3 


p3 • P3 P23 ‘ P23 


3 

P3 


+ 


mi 


3 

P23 


where M = m l -\~ m 2 + ^ 3 . Integration gives the energy integral for 
the relative 3-body equations: 


-(m l m 2 p2 + ^i^ 3 p3 + m2 m 3pD 


(III. 10) 


= k 2 M( rr ^ 2 + + E( constant). 

V P 2 P3 P23 / 

From this we see immediately that 


(m^a^ 4- m 3 p3 +- m 2 m 3 p‘2 3 ) = 0 



i.e., the expression in parentheses goes to infinity to the same order 
as I/P 23 , as P 23 — ^ *0. Hence we have the important results 


(III.ll) 




which imply also 


P2 — 0 




P23 — 0 



(III. 12 ) 
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In complex notation, the equivalent expressions are 

l*l-o(^), N-o(^), fel-o( r 4) 

and 

Returning now to the Sundman transformation (III. 3) and the 
quantities (III.4), we obtain, since dt/dr = O(r 0 ), 


(III. 14) 

Finally, denote 


dz 2 


. dt 
Z2 Tr 

dr 


dz 3 


. dt 
Z *dr 

dr 



= O(r^) 

= 0(rD. 


(III. 15) 


* = -^ = (1 - e- ro/, )(l - e — r2/1 ) ( i _ c -»»/') 
dr 

= 00 02 03 * 


Then 


and 


d 2 t _d$ d$ dt 

dr 2 dr dt dr 


^ = y | 3 e r3/( + <t>o<t>sf 2 e nli + <t>z<t>3f 0 e r °' 1 


Since 


0o = O(r 0 ), and 0 * = 0(1) , A; = 2,3, 
expressions (III. 13) indicate 



but using (III. 14), 
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d$ dz 2 

dt dr 

and similarly, 

d 2 t/dr 2 dzs = 
dt/dr dr 

Thus the Sundman transformation (III. 3) or (III. 15) does regularize 
the relative 3-body equations (III.l), (III. 2) in the case of a binary 
collision, and hence these equations are regular for all r, |r| < °o, if 
we exclude the possibility of triple collision. Therefore the solutions 
2 2 , z a are analytic in r, and can be expressed as power series in (r — T 0 ) 
for arbitrary r 0 , in some interval of convergence. 2 It follows that 
r 0 , r 2 and r 3 are analytic in r, hence $ = dt/dT is analytic in r, 
| T | < oo. Further, since 0 ^ dt/dr < 1, the equality holding if and 
only if r () = 0, then t is a monotone increasing analytic function of r, 
\t\ < o o. Thus there is a one-to-one correspondence between t and 
r for t % t real, — oo < £ < oo, — oo < T < oo. 

b. Series Solution Convergent for all Finite Values of Time. If the 
radius of convergence of the power series in (r — r 0 ) lor z 2 and z-s is 
infinite, the problem is completed, i.e., these series are valid for all 
real t , because of the one-to-one correspondence between t and r for 
all f, 7 real. 

Consider therefore that the radius of convergence about r 0 (real) 
is finite. Since there are no singularities of the differential equations 
of motion (III.l) and (II 1.2) for all real r, the finite radius of con- 
vergence about any real r 0 must be due to the existence of singular- 
ities of the solution z 2 , z :i in the finite part of the r-plane but not in 
the real axis. Suppose that the singularity which is nearest to the 
real axis for all r 0 is at a positive distance h from this axis. (Sund- 
man proves the existence of such a constant h.) 

By the Poincare transformation 

2h . 1 + w 

T Ty — lOg , 

7T 1 — U) 

the interior of the strip of width 2h in the r-plane is transformed 

2 Sundman in [ 1 1 1 discusses how the same series define a continuation of the motion 
after a collision. 


d^t/dr 2 dz 2 

dt/dr dr 
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conformally onto the interior of the unit circle |u>| = 1 in the w- 
plane. This is easily seen through the successive transformations 

f = e (*/2*)('-'o) w= tz 1 

f + 1 ' 

In particular the real axis - ao< T - To < oois mapped in one-to-one 
correspondence onto the real segment — 1 <w< 1. 




The solutions z 2 ,z 3 are therefore analytic in w, |u;| < 1, and hence 
can be expressed in power series in w , convergent in |o>| <1. These 
series will therefore converge for all real w , \ w\ <1 and consequently 
for all real t, |t — t 0 | < oo. Finally, because of the one-to-one cor- 
respondence between r and t for t , t real, — oo< r < — oo< £ < oo, 
these series converge for all real time t , — oo < t < oo. 

References 

1. E. L. Ince, Ordinary differential equations , Dover, New York, 1956. 

2. E. A. Coddington and N. Levinson, Theory of ordinary differential equations , 
McGraw-Hill, New York, 1955. 

3. S. Lefschetz, Differential equations'. Geometric theory. Interscience, New York 
1957. 


118 


«« 


4. V. V. Nemytskii and V. V. Stepanov, Qualitative theory of differential equations, 
Princeton Univ. Press, Princeton, N. J., 1960. 

5. F. R. Moulton, Differential equations, Dover, New York, 1958. 

6. R. P. Agnew, Differential equations, McGraw-Hill, New York, 1942. 

7. K. Yosida, Lectures on differential and integral equations, Interscience, New York, 
1960. 

8. F. R. Moulton, Celestial mechanics, Macmillan, New York, 1914. 

9. E. T. Whittaker, A treatise on the analytical dynamics of particles and rigid bodies, 
Cambridge Univ. Press, New York, 1959. 

10. G. D. Birkhoff, Collected mathematical papers, Vol. 1, Amer. Math. Soc., Provi- 
dence, R.I. 1950. 

11. K. F. Sundman, A lemoire sur le probleme des trois corps, Acta Math. 36(1912), 
105. 

12. E. Leimanis, Qualitative methods in general dynamics and celestial mechanics, 
Appl. Mech. Rev. 12 No. 10(1959), 

Aerospace Research Center 
General Precision, Inc. 



John P. Vinti 


N67 1^- • 

The Spheroidal Method 
in Satellite Astronomy 


I. Introduction. If r is the position vector of an artificial satellite 
of an oblate planet, relative to the latter’s center of mass, the drag- 
free motion of the satellite is determined by the differential equation 

(1) r = - V V. 

Here the gravitational potential V of the planet is expressible as 
an expansion in spherical harmonics 

(2) V “-;[ 1 -S(7) 

where r— |r|, 6 is the declination, r e is the equatorial radius, P n 
is the nth Legendre polynomial, and n = GM , the product of the 
gravitational constant and the mass of the planet. Besides the drag, 
equations (1) and (2) neglect the lunar-solar perturbation and all 
nongravitational forces. The constants J n are pure numbers charac- 
terizing the planet’s potential, with J 2 = (1.08) 10~ 3 for the Earth 
and with all the other J n ’s of the order 10 “ 6 or smaller. 

II. Possible reference orbits. Most approaches to the problem of 
solving (1) and (2) for the orbit have begun with replacing V by 
V 0 = — n/r and finding the perturbations of the resulting elliptic 
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orbit, produced by the higher harmonics. Sterne in [6] and [ 7 ] and 
Garfinkel in [2] and [ 3 ] both began with potentials of the form 
V=f(r,e ), taking into account part of the effect of the second 
harmonic. Further progress then depends on finding how the re- 
sulting intermediate or reference orbit changes with time. 

To take advantage of our knowledge of the actual shape of the 
earth, or of any oblate planet more closely resembling an oblate 
spheroid than a sphere, the author 1 in [8] and [ 9 ] decided to try 
oblate spheroidal coordinates. If X , Y, and Z are the usual rectangu- 
lar coordinates, these spheroidal coordinates p, 17, and <t> are defined 
by the equations 

(3.1) X + iY = rcos0expi0 = [(p 2 + c 2 )(l — t? 2 )] 1 2 exp i<t>, 

(3 2) Z = r sin0 = py. 

Here c is an adjustable distance, small compared to r e . For large 
r, p — > r and tj — > sin 0 . The surfaces p = constant are oblate spheroids, 
approaching sphericity as p increases, and the surfaces 7? = constant 
are hyperboloids, asymptotic to the cones 8 = constant. 

With the hope of obtaining a more accurate reference orbit as 
a starting point for the calculation of satellite orbits, the author 
wrote out the Hamilton-Jacobi equation in these coordinates, 
finding that it would be separable if V has the general form 

(4) V = (p 2 + C 2 rj 2 ) l [f(p) +g(v)]- 

On imposing the requirement that V' shall be a solution of Laplace s 
equation V 2 V' = 0 and that this solution shall be nonsingular on 
the Z-axis, one finds that the functions /(p) and g(rj) can only be 

(5) /(p) = b x p y giv) = & 2 */- 

Placing the origin at the center of mass then makes b 2 = 0 and 
requiring V 7 to have the form — p/r at large r makes b x — — p. 
Then 

(6) V' = - 2 |*V 2 = - mR e( P + ic v ) _1 . 

p + C 1} 

1 The author’s work in this area has received support from the National 

Aeronautics and Space Administration. 
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( 7 ) V' = - - pP 2 ( sin$) + ^P 4 (sin0) - ^P 6 (sin0) -\ J 

then shows that V 7 agrees with V through the second harmonic if 

(8) c 2 = r 2 J 2 . 

With such a choice for c, perfect agreement between V and Y 
requires that J 4 = — J|, J 6 — J 2 , • • •. Since observations show that 
J 4 ^ - 1.5 Jf, it follows that Y also represents about two-thirds 
of the fourth harmonic. It follows that Y accounts for about 99.5 % 
of the departure of V from the simple value - n/r that would hold 
for a spherically symmetric planet. In other words the geoid con- 
structed from Y never departs from the actual sea-level surface by 
more than a few hundred feet. Furthermore, Weinacht in [ 14 ] 
proved that separable motion of a particle in Euclidean space is 
either a Staeckel system or reducible to a Staeckel system by a point 
transformation. Of the eleven systems of coordinates in which 
Staeckel systems may be expressed, the oblate spheroidal has the 
most appropriate symmetry. Furthermore, equation (6) is the most 
flexible solution of Laplace’s equation in this system that leads to 
separability. It therefore appears likely that the orbit of a particle 
moving in the potential field (6), with c 2 =r 2 J 2 , is the best pos- 
sible reference orbit that can be chosen, from the point of view 
of accuracy of fit to the actual orbit without perturbation theory. 

III. The quadratures. If a x is the energy, <* 3 the axial component 
of angular momentum, and a 2 a separation constant that would re- 
duce to the total angular momentum in the Keplerian case c = 0, 
then with the potential (6) the Hamilton-Jacobi equation separates, 
with a solution 

(9) w = Wi(p, ot u a 2 , a 3 ) + w 2 (r), a u a 2 , a 3 ) + a 3 0 . 

If 0 ij ( 3 2 , and ( 3 3 are constants, such that in the Keplerian case 
— fix would be the time of passage through perigee, ( 3 2 the argument 
of perigee, and 0 3 the right ascension of the node, the coordinates 
p, 7/, and 0 are given by the solution of 
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( 10 . 1 ) 

( 10 . 2 ) 

( 10 . 3 ) 


^^=< + /3, = ± f p 2 F~ 1/2 dp ± c 2 f I? 2 G 1/2 dri, 
Jp\ Jo 

— = ft = =F «, f F 1/2 dp ± a 2 f G' 1/2 dti, 

da 2 

^ = ft = 0=Fa 3 f (1 - >» 2 ) _1 G~' /2 dv 
aa 3 Jo 

±c 2 «3 f(p 2 + c*)- 1 F- I/2 4p. 


Here 

G(iy) — — af + (1 — t/ 2 ) (a| + 2 a 1 C 2 rj 2 ) 

(111) / »\ / «N 

= (af - «i) ^1 “ Vj ( - T?o ^ V ^ ^ 1 ) 


a quadratic in rj 2 , and 

F(p) = C 2 a 3 + ( p 2 + C 2 )( — a 2 + 2pp + 2aip 2 ) 

= ( — 2a0 ( p — Pi) ( p 2 — p) ( p 2 + Ap + B) ( Pi ^ p ^ P 2 ) 


a quartic in p. 

IV. Factoring the quartics. The material of this and the next two 
sections is based on [ll], [12], and [13]. 

Finding the coordinates as functions of the time depends on in- 
verting equations (10.1) and (10.2) to obtain p and y\ in terms of 
t and then inserting the results into (10.3) to obtain <f>. To do this 
we must first evaluate the above integrals and this evaluation re- 
quires factoring the quartics F(p) and (j(t?). 

We may define constant orbital elements a 0 ^ — m/ 2a 1 , e 0 
3 (1 + 2a 1 a|/p 2 ) 1/2 , and i 0 = cos _1 (« 3 /« 2 ), ft, ft, ft that can be 
obtained directly from initial conditions. In this way we can 
factor G(rj) exactly and F(p) through order J\ without difficulty. 
A somewhat better set of elements is a, e, 7, ft, ft, ft, introduced 
by Izsak in [4]. Here a = (p 1 + p 2 )/ 2, e = (p 2 — pi)/(p 2 + Pi), and 
I — sin - So* The quantities ai, a 2 , a 3 , A, B f and t? 2 can all be ex- 
pressed in terms of these elements, so that the latter lead to an 
exact factorization of F(p). These elements can be obtained from 
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initial conditions by numerical solution of F(p) = 0, but they can 
be determined without such a procedure by iterated least-square 
fitting to an observed orbit. 

V. Evaluating the integrals. The integrals in (10.3) can be expressed 
as incomplete elliptic integrals of the third kind and those in (10.1) 
and (10.2) as incomplete elliptic integrals of the first and second 
kinds. It is simpler, however, to avoid such a formulation. Sup- 
pose we introduce the uniformizing variables E, v, \p , and x, de- 
fined by 

(12.1) p = a(l — ecosE) = a( 1 — e 2 )(l + ecosv) ~ 1 , 

(12.2) 7/ = 77 0 sin^, 

(12.3) exp i\ = (1 - Tjosin 2 ^) _1/2 (cos^ + i( 1 - i^) 1/2 sin ^). 

Here E and v are analogous, respectively, to the eccentric and 
true anomalies in elliptic motion, \p to the argument of latitude, 
and x to the projection of the orbital arc on the equator. By using 
an expansion in Legendre polynomials with argument — A B~ m /2, 
viz., 

(13) (1 + A/p + B/p 2 ) ~ 1/2 = f) (B m p-yp n (-\ AB- m ) , 

n= 0 \ 2 / 

we can express the p-integrals R u R 2 , Rs, occurring respectively 
in (10.1), (10.2), and (10.3), in the forms 

(14.1) (— 2a i ) m R l = — ~ AE + a(E — esin E) + A x v + 23 A^sinjw, 

1 j - 1 

(14.2) (— 2ai) 1/2 iZ 2 = A 2 v + ^2 A 2 j sinjv, 

;= i 

(14.3) ( — 2«i ) 1/2 R 3 — A‘ 3 v + ^ A sj sin jv. 

Here the coefficients A u A 2 , and A 3 are infinite series, leading to 
an exact evaluation of secular effects for the reference orbit, and 
the sine terms are carried far enough to give periodic effects 
through order </ 2 2 . 
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We can express the corresponding ^-integrals as 

Ni = Cj + + Z Cl; sin 2 j i, 

i= i 
2 

N 2 = C 2 \fr + ^ C 2; sin 2 j\p y 
y - 1 

(15.3) N 3 = C 3 x + Cd + C 32 sin2i/'. 

Here and C 2 are expressible in terms of the complete elliptic 
integrals K(?j 0 / 7 j 2 ) and £(r? o/vz) and N 3 in terms of an infinite 
series. The results for the ^-integrals are thus also accurate enough 
to give secular effects exactly for the reference orbit and periodic 
effects through order J|. 

VI. Solution of the kinetic equations (10). One inserts (14) and 

(15) into equations (10.1) and (10.2), placing 

(16) E = M s + E p , v = M s + v py \p = \p s + \p p . 

The secular terms M s and are then found by dropping E p , v p , 
}p p and the sine terms in (10.1) and (10.2) and solving a pair of 
linear algebraic equations. The secular mean anomaly M s appears 
as the product of 2ttv x and a linear function of t + (3 X ; the secular 
term \f/ s is the product of 27r^ 2 and a linear function of t + £ 1 . Here 
v x and i >2 are, respectively, the mean p-frequency dajdji and the 
mean ?/- frequency da x /dj 2 , j 1 and j 2 being the corresponding action 
variables (see [10]). 

One then expresses the periodic terms as 

(17) E P =E 0 +Ei+E i9 v p =v i) +v l + v 2 , t P = 'P 0+^1 + ^ 2 , 

where E n , e.g., denotes a periodic part of order J 2 . One then places 
E p =E 0l v p = v 0 , \pj = ^0 into (10), rejecting all periodic terms of 
order J 2 or higher, and solves (10.1), (10.2), and (12.1) for E 0 > v 0 , 
and \fr 0 . Here M s + E 0 appears as the solution of the Kepler equation 

(18.1) M s + E 0 - e'sm(M s + E 0 ) = M sy 

(18.2) e ' s ae/a 0 < e. 

One continues by adding in the terms E u v u and yp x into equations 
(10), rejecting only those periodic terms of order J 2 or higher. Then 


(15.1) 

(15.2) 
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M s + E 0 + E x satisfies a similar Kepler equation, the right side 
getting an additional term M u periodic of order J 2 , depending on 
v 0 and \p s + \f/ 0 . This second Kepler equation does not require a 
full-fledged solution, but may be solved by a differential method. 
Knowing E u one may then use (12.1) to find v x and (10.2) to find yp x . 

One continues in a similar fashion to find E 2 , v 2 , and \p 2 . For the 
reference orbit the secular parts of E , v, and \fr are then known 
exactly and the short-periodic parts through order J 2 . There are 
no long-periodic terms in this solution for the reference orbit. 

Equations (12.1) and (12.2) then give the spheroidal coordinates 
p and 77. The right ascension 4> follows from (10.3), (14.3), and (15.3), 
after insertion of \p into (12.3) to find x* This completes the solu- 
tion for the reference orbit. 

VII. A sketch of the necessary perturbation theory. If we subtract 
(7) from (2), we find that the part of the gravitational potential 
not accounted for in the reference orbit is given by 

(19) V- V' = ^J 3 P 3 (smO) Jl) P<(sine) + .... 

r r 

As an example, we consider here only the residual fourth harmonic, 
so that the perturbing term in the Hamiltonian becomes 

(20) H’ = ^-( J 4 + Ji) P 4 (sin0). 

r 

If we use the formulas of elliptic motion for r and 0, viz., 

(21) r = a(l — ecosE) = a(1 ~ g) , 

1 + e cos v 

(22) sin0 = sin I s\n(v + (S 2 ) 

the perturbation H' will be correct through order J 2 . This order 
of accuracy will result in secular and short-periodic changes correct 
through order J\ and long-periodic terms correct through order 
J 2 - (It is well to emphasize at this point that this order of accuracy 
is for effects produced by less than 0.5% of the departure of the 
planet from sphericity; for the 99.5% of this departure already 
accounted for by the potential (7) the secular effects are exact 
and long-periodic effects do not exist.) 
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In doing the perturbation theory, the first canonical variables 
that come to mind are the Jacobi “constants” a u « 2 , «3, 0i> 02, 
and 0 3 . Their shortcomings are well known, however, since they 
lead to Poisson terms in and a 2 . The next set that comes to 
mind is the one generated from the a s and 0’s by the generating 
function 

(23) S' = — a\t + m( — 2a0 _ 1 / 2 0i + « 2 0i + of 3 03* 

If we define /i 0 by 

(24) m = nfal a 0 s - m/2«i 
this leads to the set 

L = (m^o) 1/2 » i = Ho(* + 0i) 

(25) a 2 0 2 

Of3 03, 

canonical with respect to the Hamiltonian 

(26) H= -h 2 /2L 2 + H'. 

One may attempt to apply the von Zeipel method in the way 
successfully used by Brouwer in [l], first eliminating short-periodic 
terms and then proceeding to eliminate long-periodic terms. One 
then finds, however, that the corresponding generating function 
Si, which ought to be of the first order in the parameter 


(27) 


a — c/4 e/ 2 


must satisfy 


(28) 


dS? 

00i 


Zeroth order in a. 


One may alternatively eliminate short- periodic and long- periodic 
terms simultaneously, but one then obtains a Poisson term of the 
form 1 ;' sin 202 in a 2 — Since v' has a secular part, such a result 
would appear absurd, since the “constant” a 2 , which ought to have 
only a small periodic variation, would then become infinite. These 
difficulties are examples of the failure of the von Zeipel method 
whenever the following conditions both hold: 
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(1) the perturbation has a long-periodic part of the first order 
in the perturbation parameter a, and (2) the canonical variables 
are such that the unperturbed Hamiltonian depends only on L. 

The following set, however, is successful. If j u j 2 , and j 3 are 
the action variables and if w u w 2 , and w 3 are the corresponding 
angle variables, we define L, G, H, l, g, and h by 

2wL = ji + j 2 + jz sgn a 3 , 

(29) 2nG = j 2 + y 3 sgn« 3 , 

2wH = 7*3, 

l — 2 ww u 

(30) g = 2t(w 2 - w x ), 

h = 2 tt(w ; 3 - w 2 sgn a 3 ) , 

where sgn « 3 = ± 1 respectively for a direct orbit or a retrograde 
orbit. To verify that they are canonical, note that 

(31) Ldl + Gdg + Hdh = jidWi + j 2 dw 2 + jsdw 3 . 

They were introduced by Izsak in [5] in his application of the 
author’s theory to the problem of the critical inclination. 

If 

(32) it* — dji/ dak (i,k = 1,2,3) 

the /?’s are then given by 

2ir(t fti) = 7*21 (/ g) + j ill, 

(33) 2ttI3 2 = j22(l + S) jnU 

27T03 = 2i rh + 7* 13 Z (7*23 + 27rsgna 3 ) (l g). 

The constant orbital elements in the perturbed problem then be- 
come the constant parts a ", e", and rjo of a, e, and vo, along with 
the initial values l£ y go, and fifi of the secular parts of l, g, and h . 
The corresponding Hamiltonian F is given by 

(34) F=F 0 (L,G,H) + F 1 
where 

(35) 


— — «l, 


Fi = - H> ' 
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and 



• OF 

1 = 

dF 


L ~~dT’ 

dL’ 


■ OF 


dF 

(36) 


g = 

~dG’ 


dF 


dF 


H = Jh’ 

h = 

dH' 


One cannot express the unperturbed Hamiltonian Fo = ~ « i exactly 
as a function of L, G, and H, but it is not necessary to do so. One 
needs only the derivatives 


dFo = j, dFo d£i 
dL i dji dL 


2irv\, 


(37) 


dFo = y dFo dU 
dG j dji dG 


2ir(vi — ^ 2 ), 


dFo 

dH 



dF ° ^ 9 ( ^ 

— ~ 7777 = 27r(i/ 2 Sgno(3 “ " 3 )- 

dji dH 


On applying the von Zeipel method, one first carries through 
the simple but tedious elimination of the short- periodic terms. Pro- 
ceeding to the long-periodic terms, one finds that the appropriate 
generating function S*(L' ,G" ,H,g') must satisfy 


(38) 


dF 0 ds r 

dG" dg' 


(F\) 


long- periodic* 


leading to 

(39) 2* (»f - v'{) ^ = of(L', G", H) cos 2 g' , 

dg 

where / is a certain function of 7/, G", and H . Since v" — v'{ is 
proportional to 1 — 5 H 2 /G" 2 ~ 1 — 5cos 2 7, this leads to the familiar 
resonance denominator, whenever a = J 4 + Jf ^ 0. Since o / (v" — ) 

— 0( c/ 2 ) , the long-periodic terms are accurate through order J 2 . 

After finding the above canonical variables as functions of time, 
one easily converts their changes into changes of a u « 2 , and « 3 or 
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of a, e, and I. On inserting the functions a(t), e(t ), I(t ), l(t ), g(t ), 
and h(t) into equations (10), one then can find, by differential 
methods, the changes in E, u, and x, and thus in the coordinates, 
that are produced by the perturbation. It is not necessary to do 
a complete re-inversion of (10.1) and (10.2). 
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Precession and Nutation 


I. Couple due to distant mass acting on nonspherical object. Let 

the axes be principal axes of inertia at the mass-center 0 (see 
Figure 1). Let the distant mass be a particle of mass M' at P(x,y, z)> 
and let r 2 = x 2 J - y 2 + z 2 . Let a typical particle of the object be 
of mass m at Q(£, y,t), and let 

P 2 =(x-0 2 +(y-v) 2 +(z- f) 2 . 



Figure 1 . Couple Due to Distant Mass 
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Then the force components acting on m are 
X = GM'm{x — £) /p 3 , 

Y = GM'm(y — ??)/p 3 , 

Z = GM'm(z — f) /p 3 . 

Thus 

L = moment of forces about Ox 

=Z(vz- m 


^ GM'm { , 

= X — 3 — M 2 — f) — f(y — »?)] 




But, neglecting squares and products of £, 77 , f, we have 
P 2 = (* - £) 2 + (y- 1?) 2 + (2- f) 2 = r 2 - 2 (x£ + yj;+zf), 
3(xf + yjj + 2f) 


1 _i_" 

”3" ~3~ 

p p L 


1 + 


r 


Therefore, neglecting terms of the third order in (£,vA), 

L- ^ E-Ml* - ») [1 + 3( ^ + « + ^> ] 

-— r .' yz x.™^- rt, 

since = 0 and £mt;f = 0 , etc. But if the 

principal moments of inertia at 0 are A, B, C> we have 

A =X>Gf 2 + f 2 ), B = Z m (f 2 +£ 2 ), C = £m(£ 2 + A 

^m(v 2 — f 2 ) = C — B, 


(1) a L = 


2>GM'(B - C)yz 


M= - 


3 GM'(C - A) zx 


N= - 


3 GM'(A - B)xy 
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II. Case* of an axially symmetrical body (e.g. the Earth). In this 
case A = B < C, and the couple has components 

3GAP(C - A)yz 3 GM'{C - A)zx n 

(2) p > ^5 » u * 


If x = rcos<5, y = 0, z = rsin5, so that 5 is the declination of the 
distant object, the couple components are 


(3) 


0 , 


3 GM'(C- A) 


sin 5 cos 5, 0, 


i.e., the couple is in the plane of M' and the Earth’s axis, and is 
in such a sense that (if the Earth were not rotating) it would tend 
to move the Earth’s equatorial plane into coincidence with M' . 

III. Couple components derived from potential energy. Let U be 

the gravitational potential at P due to the nonspherical object, so 
that the mutual potential energy of this and the mass M' at P is 
V — — M' U . To the order of approximation already assumed in 
§1, U is given by MacCullagh’s formula, proved in many text books, 

TT GM j G(A + B+C-3I) 

~ f 2? ’ 

where / is the moment of inertia of the nonspherical object about 
OP and M is its mass. Thus 


(4) 


GMM' GM'(A + B+C-3I) 

' r 2? 


From this the general formulas (1) for L, M, N may easily be 
deduced by considering changes in V produced by small rotations 
about the axes of coordinates. 

Restricting ourselves to the case of axial symmetry (A = £), 
we have 


/ = A cos 2 5 + C sin 2 5 , 
where 6 is the declination of P. Thus 

A + B + C - 31 = 2A + C - 3(A cos 2 5 + Csin 2 6) 
= (C~ A)(l - 3sin 2 5). 
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(5) 


V= - 


GMM' GM'{C- A){ 1 - 3 sin 2 5) 
r 2r 3 


where it may be noted that 

^ (3sin 2 5 — 1) = P 2 ( sin 5), 

P 2 being the Legendre polynomial of the second degree. Thus the 
couple due to M f tending to decrease 5 has a magnitude 

dV 3 GM'(C-A) . 

— — = o sin 5 cos 5 

65 r 3 


which agrees with (3). 

This method is useful if it is required to ascertain the possible 
effect of the fourth harmonic. Taking 


U = 


GM 

a 



sin 2 5 


) + l D ? P ‘ (sinS) _ 


— Uq U 2 U 4 , 


we have 


P 4 (sin<5) = ^ (35sin 4 5 — 30sin 2 5 + 3) 
8 

and 


dP_< 

65 


1 

2 


(35sin 2 6 — 15) sin 5 cos 5, 


which is numerically less than or equal to 10 sin 5 cos 5. Thus 


dUjdb 

< 3B DX1 ° 

K- 8D 


dU 2 /d5 

= JX 2 1 

w 1J 1 


With J = 1.6 X 10 ~ 3 , D = 8 X 10 “ 6 , so that D/J is about 1/200, 
and with a/r = 1/60 in the case when M r is the Moon, this gives 
about 1/600,000. Since we shall not determine precessions to more 
than 4 significant figures, the effect of the fourth harmonic in the 
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Earth’s potential may be neglected. 

IV. Average couple when M f moves in a circle around the Earth. 
Suppose M' moves, for example, in the plane of the ecliptic (see 
Figure 2). Let P and K denote the poles of the equator and the 


z 



Figure 2. Mass in the Ecliptic 
ecliptic respectively. Take the x-axis along Oy. If 

3 GM'(C- A) 

K = ^ > 

2 r 

which in turn is equal to the maximum numerical value of the couple, 
occurring at <5 = d= 45°, the couple has, by (2), the components 
2 nyz/r 2 , — 2nzx/r\ 0. But x = rcosL, y = r sin L cose, z — rsinL sine, 
where e is the obliquity of the ecliptic and L = angle 7 OM' increases 
uniformly with the time. Thus 

yz — r 2 sin e cos e sin 2 L and zx = r 2 sin e sin L cos L 

have average values (1/2) r 2 sine cos^e and 0 respectively. Thus the 
average couple has components (^,0,0), where 

. SGM'(C-A) . 

N = k sin e COS ( = o sin e COS e . 

2r 3 

If the Earth were not rotating, this couple would tend to move 
the pole P of the equator towards the pole K of the ecliptic. 

We shall now consider the effect of such couples on the Earth, 
supposing it rigid. (The Earth is deformable, but not enough to 
affect the precession seriously, as has been found by several authors, 
and as is obvious from consideration of angular momentum.) The 
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i 


P 



Figure 3. Rotation of the Equator 

following is an approximate treatment, which ignores for the moment 
the distinction between the Earth’s axis of figure and axis of rotation. 

V. Effect of steady couple on the rotating Earth. Suppose a steady 
couple N ' about Oy acts on the Earth (axial moment of inertia C, 
angular velocity oj). In a small time t , the angular momentum Co; 
about OP receives a vector increment N't about Oy. Thus P moves 
towards the instantaneous position of y at rate n ~ N'/Cw , and KP 
rotates' about K at a rate & = ncosect. 


(6) 

(7) 


, . 3 GM' C-A 

n = motion of pole = — — sin e cos e , 

2r 3 aj C 


k = motion of equinox = 


3 GM' C-A 
2r 3 a) C 


COS€. 


Here ( C—A)/C , the Earth’s “mechanical ellipticity”, is found 
(actually from the precession) to be about 1/(305.5). 

Neglecting the eccentricity of the Earth’s orbit and the eccentri- 
city, inclination, and solar perturbations of the Moon’s orbit, the 
above formulas give the solar and lunar precessions, on substituting 
solar and lunar values of M f and r. (Variability of the couples 
produces nutation, to be considered later.) 

VI. Rough numerical values. For the Sun, GM' =n /2 r 3 , thus k 
from the Sun per year (2ir/n') is 


37T 


n' C- 


A 


1944000" cos 23° 27' 


* C 


cost = 


366.24 X 305.5 


= 15".9, 
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which is correct to one decimal. The relative importance of the 
Sun and the Moon (neglecting eccentricities, etc.) in producing 
precession (also tides) may be found from the ratio of their values 
of M'/r 3 . This is found, directly, or from (M/(E + M))(n/n f ) 2 
(see §XII), to be about 2.17. Thus k from the Moon per year is 
about 2.17 X 15".9 = 34".5, and the total luni-solar precession is 
about 

k = 15".9 + 34".5 = 50".4/year = 1° .4/century. 

VII. Luni-solar precession. If the ecliptic were fixed, P would go 
around K in a circle, in the retrograde direction, with period 360/1.4 
centuries, or about 25,700 years, with constant inclination e. The 
longitude of a fixed star would increase by about 50"/year, its 
latitude remaining constant. That is, AX = kt , A(3 — 0 y or X — k , 

= 0. With t = 23°27', sine == 2/5, so that motion of pole = n 
— k sine == 20"/ year. The effect of luni-solar precession (only) 
on right ascension and declination is shown in books on spherical 
astronomy (e.g. [13]) to be given by 

a = n cot e + n sin a tan 5, 

8 — n cos«. 

VIII. Effect of motion of the ecliptic (planetary precession). The 

plane of the ecliptic slowly rotates on account of planetary pertur- 
bations of the Earth's orbit. Let N, N' be the ascending and 
descending nodes of the ecliptic at time t + dt(dt > 0 ) on the eclip- 
tic at time t\ let y N = 180° — v and N'y = v. At 1950.0, v = 5°36 r , 
and j = speed of rotation of ecliptic = 0".471/year, which is less 
than 1/40 of the polar motion n. 

Resolve the rotation of the plane of the ecliptic into 

(i) j cos v about the line of equinoxes, decreasing e, so that t 
= — jcosr. At 1950.0, e= — 0". 469/ year (see Figure 5). 

(ii) /sin*/ about the line of solstices, giving yy'siru = j siru>\ 
thus 7 would move forwards on a fixed equator at a rate j sin v / sin e 
= l (see Figure 4 ). Thus / (which is slowly decreasing) is the plan- 
etary precession (in R.A.), decreasing all right ascensions. At 1950.0, 
j sin */ = 0".046, / = / sin p/sine = 0'M 15 (both yearly). 

IX. Full formulas for «, 5. Planetary precession contributes 
a = — /, 5 = 0 (since declinations are not affected). Thus 
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Figure 5. Motions of the Poles of the Equator and the Ecliptic 

a = m + n sin a tan 8, 

8 = ' n cos a, 

where m = “precession in R.A.” = ncote - l = kcose — l. General 
precession (in longitude) is accordingly 

p = n COSeC e — / COSe = k — / cose. 

The formulas are often used as Aa = (fh + n sin a tan 6) A/, A 5 
= /i cose* A/, where m , n refer to mid-epoch and a, 8 either refer 
to mid-epoch or are means of initial and final values of a, 5. But 
rigorous formulas must be used for long intervals or large 8. 

X. Numerical values for 1950.0. 

e = 23°27' n = 20 // .04 k = n cosec e = 50 /7 .37 

m = xt cot e - l = 46". 10 p = k- / cose = 50" .27. 

The above are Newcomb’s values, rounded to 4 figures. They take 
the mean of the stars to be at rest. But two factors affect the 
fourth figure, in the ways discussed in the next section. 

XI. Effect of galactic rotation and relativity, (i) It is now known 
that the mean of the stars is not at rest, because the galaxy is 
rotating. The effect on proper motions is of the order of 1" per 
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century. See, .for example, [9], [10], [3], [14, p. 382], [8] and 
[7, p. 16]. 

(ii) The dynamical interpretation of the precession also needs 
modification. The Newtonian dynamics being given really applies 
in a frame rotating at 1".94 per century directly around the pole 
of the ecliptic. See, for example, [3] and [12]. 

The effect of (i) and (ii) is that corrections of about + 0".01 
and + 0".02, or a total of + 0".03, are needed by annual preces- 
sions in longitude (such as k and p) if they are to be used to 
determine (C— A)/C. Thus the more refined treatment of the 
luni-solar precession now to be given is adapted to the value 
k = 50".40 (at 1950.0) instead of Newcomb’s 50".37. 

XII. Luni-solar precession (more exact treatment). If n , a refer 
to the Moon’s orbit and n', a' refer to the Sun’s orbit, let 

^ 3 GS C-A Jjr 3 GM C-A 

Kl ~2 a' 3 <o C ’ 2 ~ 2 a 3 * C ' 

Here S and M are the masses of the Sun and Moon, and K u K 2 
are constants such that (by the theory of §V) the Sun and the 
Moon would cause polar motions 

n x = Ki cost sin t, n 2 — K 2 cos t sin € 
and precessions in longitude 

k x = K icost, k 2 = K 2 cos t 

if the eccentricity e f of the Earth’s orbit and the eccentricity e 
and inclination i of the Moon’s orbit were zero and the Moon’s 
motion were not perturbed by the Sun. Let E — the mass of the 
Earth. Then 

G(E + M) = n 2 a\ 

3 M n 2 C-A 
K<1 ~ 2 E+M* C * 

E/S = 3 X 10‘ 6 and may be neglected here. Put n = M/E . Then 

3 n /2 C - A 3 (i n 2 C — A 

= 2 C ’ 2_ 2 1 + m^ ~C~~‘ 


G{S + E) = n /2 a'\ 

Thus 

3 S n /2 C — A 
Kl ~2S + E w C ’ 
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3 n' 2 

2 CO 


n 


3 1296000"/year 
2 


5308"/ year. 


2 co/n' 2 366.24 

Using (C — A)/C = 1/305.5 we find that the value of K x is 
K x = 17".37 per year. 

Also 


K 2 _ M/a 3 n / n \ 2 

Ki = s7^ = TT7 U 7 /' 

Using fi = 1/81.3 and n'/n = 0.07480 we find that 

K 2 /K x = 2.172. 

As 2.172 X 17.37 — 37.73, we find that the value of K 2 is 
K 2 = 37". 73 per year. 

XIII. Allowance for eccentricities, etc. To allow for eccentricity 
of orbits, we require the average of 1/r 3 rather than 1/a 3 . But it 
is easily shown that in a Keplerian orbit the average of r -3 is 
a -3 ( 1 — e 2 ) -3/2 , so that the factor (1 — e 2 ) -3/2 , or approximately 
l + (3e 2 /2), allows for eccentricity. 

In addition, for the Moon, if a is defined by G(E + M) = n 2 a 3 , 
a is not the mean distance. The variational orbit gives 

a/r — 1 + \ m 2 + m 2 cos2£ + 0(/rc 3 ), 

6 


where m = n'/n temporarily, and | varies linearly with the time. 
Thus 


(a/r) 3 = l + ^m 2 + 3m 2 cos2£ + 0(m 3 ), 
z 

and the average of (a/r) 3 is 1+ |m 2 approximately. 

Thus, omitting squares and products of e' 2 , e 2 , m 2 , define 


(8) 

P i = 


K u 

(9) 

P 2 = 
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This takes approximate account of everything but the inclination 
of the Moon’s orbit to the ecliptic. The m 2 correction is often 
omitted in textbooks, but is of the same order as the e 2 correction. 

A much more complete orbital correction is given in [2]. With 
e' = 0.0167, e = 0.05490, m = 0.07480, we find 

(10) Pj = 1.00042 K x - 17". 38 per year, 

(11) P 2 = (1 + 0.00452 + 0.00280) K 2 = 1. 00732 K 2 = 38". 01 per year. 

P 2 now needs correction for the inclination i = 5°9'. We shall 
prove in working out nutation (see §XXII) that the factor is 
1 — (3/2) sin 2 i approximately. Thus let 

P 2= ( 1 + ^ e2_ l Sin2j + \ ^ 

(12) V ' 

= (1.00732 - 0.0121) K 2 = 0.9952 K 2 , 

or rather we shall take the more accurate value 
(13) P' 2 = 0.99537 K 2 = 37".56 per year. 

Then 

P = F\ -f P' 2 = 54". 94 per year. 

This is the most constant quantity connected with precession, and 
Newcomb proposed to call it the precessional constant. He gave its 
value as 54". 9066 per year, decreasing only 0". 0000364 per century. 
The difference of about 0".03 has been explained in §XI. Spencer 
Jones, in ([7], p. 16), gives 

p = (54". 93553 ± 0". 00145) per year. 

Note that c is slowly decreasing, 
n = Pc osfsint is slowly decreasing, 
k = P cost is slowly increasing, 

m = P cos 2 f — l is slowly increasing, partly because t is decreasing 
and partly because l is decreasing. 

At 1950.0 we have (for ^ = 1/81.3, (C — A)/C = 1/(305.5)) 

P x cose = 15".94 per year, 

P 2 cos € = 34". 87 per year, 

P 2 cos e = 34 ".46 per year, 

P cos e = 50". 40 per year (compare 50". 37 Newcomb). 


( 14 ) 
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XIV. Axis of figure and axis of rotation. A polhode and herpolhode 
argument shows that the difference in direction in steady preces- 
sion is about 

23.5 X 3600" 

= 0".009. 

25700 X 366 

XV. Nutation. The motion of the pole of the equator may be 
divided into a secular part (precession) and periodic parts 
(nutation). 

If at any time K is the pole of the ecliptic, P 0 is the mean pole 
of the equator and P is the true pole of the equator (see Figure 6) , 
we define the nutations as follows: 

(1) nutation in obliquity = Ae = KP — KP 0 , 

(2) nutation in longitude = A\p — P 0 KP, positive as shown; 
note that also equals y 0 y, positive as shown, and is equal to 
AX, the true minus mean longitude of any star. Nutation arises 
from various causes, and its complete determination is rather 
complicated. Much the largest part arises from the retrogression 
of the Moon’s nodes with period about 18.6 years; we shall 
investigate this, and consider briefly two other effects. 


K M' 



Figure 6. Motion of Pole of Equator 

The final results of complete investigations show that the 
largest terms are 

A« = 9". 21 cos $2 - 0".09cos2S2 + 0".55cos2 L + 0".09cos2(L , 

A* = - 17".23sinfi + 0".21sin2Q - l".27sin2L - 0".20sin2(l , 

where the largest neglected coefficients are about 0".02 in A< 
and 0".13 in A^. Here 
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12 = longitude of ascending node of Moon’s orbit on ecliptic, 

L = Sun’s mean longitude, 
d = Moon’s mean longitude. 

XVI. Orbit of the Moon. The period relative to either the 
fixed stars or the equinox is 27.322 days. The orbit is inclined 
at an average inclination of 5°9' (zh 9' on either side) to the 
ecliptic, and the nodes regress (not quite uniformly) in a period 
18.60 years relative to the fixed stars, or 18.61 years relative 
to the equinox. 

XVII. Precessional effects of Sun and Moon. The Sun and 
Moon, according to the theory of §XIII, if moving at any angle 
t to the equator, would set up an average motion of pole n 
= (P x + P 2 ) sin e cos e and precession in longitude k— (Pi + P2) cost, 
where P^Sun) = 17". 38 per year and P 2 (Moon) = 38" .01 per 
year are practically constant. At 1950.0 we have 

Pi cost = 15". 94 per year, P 2 cost = 34". 87 per year. 

In investigating nutation, we shall also deal with the correction 
of P 2 for inclination in evaluating precession. 

XVIII. Effect of Moon’s orbital inclination and nodal retrogression. 

In Figure 7, K is the pole of the ecliptic, M the pole of the Moon’s 
orbit, P the pole of the equator. M moves backwards around K 
in a circle of radius i = 5°9 / . Thus P describes a wavy path. Let 
6 — PKM = 180° — 12, where 12 = longitude of the ascending node 
of the Moon’s orbit on the ecliptic. 0 increases by 2w in 18.61 years. 
If 12 = N 0 — Nit, then approximately 

8 = N,= = 0.3376 rad/year. 

18.61 

As 18.61 years is long compared with a month, we may suppose 
that the Moon is giving P a velocity P 2 cos PM sin PM per- 
pendicular to PM. If £ and 77 are the small displacements of P 
along and perpendicular to KP y we have 

£ — P 2 cos PM sin PM sin KPM y 

77 = P 2 cos PM sin PM cos KPM. 

£ is the nutation in obliquity, A«; 77 gives a secular part (lunar 
precession) and a periodic part (Ai^sint)- 
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Figure 7 . Motion of Pole of Moon’s Orbit 

XIX. Nutation in obliquity. Using sin PM sin KPM = sin i sin 0, 
we have 


£ — P 2 sin i sin 0 cos PM 
= P 2 sin i sin 0 (cos t cos i + sin € sin i cos 0) 


— - P 2 cos e sin 2i sin 0 + ^ P 2 sin t sin 2 t sin 20. 
z z 

But 0 = N u so that integration gives 

, P 2 cos € sin 2 i P 2 sin € sin 2 i 

£ = ^7 cos 0 — cos 20 


2 N X 


4Ni 


N cos( 180 ° — 12) — - iVtant tan i cos( 360 ° — 212) ; 
4 


or 


At = IV cos 12 — - iV tan t tan i cos 212 , 
4 


where 


iV = constant of nutation 

P 2 cost sin 2i 34". 87 sin 10 ° 17 / 26 // 


2 N X 


2 X 0.3376 


= 9 ". 226 , 


and 


\ N tan t tan i = 0 ". 09 . 
4 
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Thus 

At = 9 " .23 cos 12- 0 " .09 cos 2 12, 

and this explains the major part of the nutation in obliquity 
due to the rotation of the Moon’s orbit. The observed coefficient 
of cos 12 is 9 ". 21 . 

XX. Polar motion in the direction of decreasing longitude. 


P 2 cos PM sin PM cos KPM 


= P 2 cos PM sin PM 


COS l — COS £ COS i 

sin t sin PM 


= P 2 (cos i cosec t cos PM — cot t cos 2 PM ) . 


So 

rj = P 2 cos i cosec e (cos £ cos i + sin £ sin i cos 0) 

— P 2 cot £ (cos £ cos i + sin £ sin i cos 0) 2 . 

Using cos 2 0 = (1 + cos 20 ) / 2 , ?? may be split up into a constant 
part (giving lunar precession) and periodic parts (giving nutation) . 

XXI. Periodic parts. 

77 = P 2 cos i sin i (1 - 2cos 2 Ocos0 — - P 2 cosf sin£ sin 2 i cos 20 


= — - P 2 cos2£sin2icos0 — - P 2 sin 2 £ sin 2 i cos 20. 
2 4 


Thus integration gives 

P 2 cos 2 £sin 2 / . P 2 sin 2 £sin 2 i . _ 

, sme - 8N| — - 2 » 

cos 2f 1 

= — N sin Q, + - N sin £ tan i sin 212 

cos£ 4 

= - 6".87sinl2 + 0".08sin2S2 

(using N = 9 r/ . 226 ). Using the observed iV = 9 ". 21, the coefficient 
of sin 12 is — 6 /r .86. Thus, neglecting the small terms in 212, the 
observed N gives 


£ = 9 /r .21 cos 12, 


j) = — 6".86sinl2, 
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so that the equatorial pole P describes an ellipse (shown in Figure 8) 
with major axis about 18". 4 along P 0 K and minor axis about 13". 7, 
around the mean pole P 0 , with period 18.61 years, in the sense 
indicated (remembering that 12 decreases) . 



Figure 8. Polar Nutation. 


The nutation in longitude is given by 

Ai/'sin* = the above periodic part of y, 


so that 


A\p = —2 N cot 2e sin 12 + \ N tan i sin 212 

4 

= — 17". 27 sin!2 + 0".21 sin2l2 

(using N=9".226). Using N = 9".21 gives 17". 24. This explains 
the major part of the nutation in longitude due to the rotation 
of the Moon’s orbit. 


XXII. Constant part (lunar precession). This should reduce to 
P 2 cosesine when i = 0 . The constant part is given by 

r j = P 2 cot € cos 2 / — P 2 cot f ( cos 2 e cos 2 / + - sin 2 e sin 2 / 


= P 2 costsine ^1 — ^ sin 2 / j = P 2 cosfsi 
= P 2 cos e sin e , 


sint- 


1 3 cos 2 / 


where 
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Pi = P 2 ( i - | sin 2 ?' j = 0.98794 P 2 = 0.98794 X 38" .01 

= 37". 55 per year. 

The factor 1 — (3/2)sin 2 i is needed in dealing with the lunar part 
of the precession (see §XIII). 

XXIII. Nutation due to variability of the solar couple during 
the year. The average solar couple (* sine cost, 0,0) of §IV results 
from averaging the actual couple 

(2Ksin 2 Lsint cost, — 2/csinLcosLsint,0), 

where we are supposing the Sun’s orbit circular and using L for 
the Sun’s longitude and 

* = 3GS(C- A)/2a'\ 

In discussing precession, we have taken account of the average 
couple; we have now to take account of the periodic couple 

( — k sin t cos t cos 2L, — k sin t sin 2 L) 

about Ox and Oy of Figure 2 respectively. Putting K x = k/Coj, 
these produce polar motions respectively 

— Ktsint cost cos 2L along Py and — X!sin€sin2L along KP . 

But in the notation of §XVIII, these are r\ and £ respectively, i.e., 

£ = — K 1 sinesin2L, 

7/ = — - Kisin2t cos2L. 

2 


If L = n’ = 27r/year, we have on integrating 

Ki sine y Xisin2f . „ r 

£ — — . — - cos 2 L = At, r? = 7 — sin 2 L = A^ sin < , 

2m 4n 

K,COS( . . 

A#= --2^~ sln2i - 

Using Xj = 17". 37/year, n' = 27r/year, we have 
At — £ — 0".55cos2L, 

A^sin* = ri = — 0".50sin2L, 

S\p = cosec ( = — l".27sin2L, 
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the coefficient of sin2L in — A\p being simply 1/2 ttX solar pre- 
cession in longitude in 6 months, or 7".97/2tt. 

This explains the six-monthly terms due to the Sun. 

XXIV. Nutation due to variability of lunar couple during the 
month. Neglecting the inclination of the lunar orbit to the ecliptic, 
we have quite similar results for the Moon. If n = 27 r/ 27 d .322 
= 13.37 X 27r/year, we have 


K 2 sine . 37". 73 sin 23°27' 

At = — cos 2(L = 


A\p = 


2 n 

K 2 cos e 
2 n 


sin 2d = 


13.37 X 4 t r 
37". 73 cos 23°27' 


13.37 X 4 t r 


cos 2d = 0" .09 cos 2d, 

sin 2d = — 0". 21 sin 2d, 


the coefficient of sin 2d in — A\p being simply 1/2tt X lunar pre- 
cession in longitude in (1/2) X 27.322 days. This explains the 
fortnightly terms due to the Moon. 

We have now derived all the terms given in §XV. 


XXV. Difference between axis of figure and axis of rotation. 

If the orientation of the Earth is specified by Eulerian angles, 
and these have been determined as functions of the time, the 
position of the axis of rotation at any time may easily be deduced 
(from Euler’s geometrical equations). Woolard in ([16], p. 159) 
finds that no nutation coefficient in £ or differs by more than 
0".0062 according to which axis is used; there is also a constant 
difference of 0".0087 in £ arising from precession (see §XIV); 
the axes may, however, differ in direction by up to 0".4 on account 
of variation of latitude. Federov in [l] uses the axis of angular 
momentum, which never differs by more than about (T.001 from 
the axis of rotation. 

The results in Table 1, to three decimals of a second of arc, 
appear to follow for 1900.0 from ([16], pp. 153 and 159); see 
also ([6], p. 78). 


XXVI. Comparison of theory and observations. The usual method 
has been to solve for the nutation constant N, all other nutation 
coefficients being regarded as known multiples of N. It appears 
from ( [5], p. 224) and [6] that the difference of one or two hundredths 
of a second of arc between the calculated (rigid body) and the 
observed values of N may be accounted for geophysically (some 
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cos 12 

cos 2Q 

cos2L 

cos 2d 

Rotation 

+ 9".210 

- 0".090 

+ 0".552 

+ 0".088 

Figure 

+ 9 .209 

- 0 .090 

+ 0 .555 

+ 0 .094 

Difference 

+ 0 .001 

0 

-0 .003 

- 0 .006 


n 

= Arsine 




sin 12 

sin 212 

sin2L 

sin 2d 

Rotation 

- 6" .858 

+ 0".083 

- 0".507 

- 0".081 

Figure 

- 6 .857 

+ 0 .083 

- 0 .510 

- 0 .087 

Difference 

- 0 .001 

0 

+ 0 .003 

+ 0 .006 


j? cosec t - 

= A\p (longitude) 



sin 12 

sin 2Q 

sin2L 

sin 2d 

Rotation 

- 17".233 

+ 0".209 

- 1".273 

- 0".204 

Figure 

- 17 .230 

+ 0 .209 

- 1 .280 

- 0 .220 

Difference 

- 0 .003 

0 

+ 0 .007 

+ 0 .016 

Earth models would allow a greater difference, and the actual 
difference is well within the bounds of possible explanation). For 
astronomical purposes, it seems sufficient to adopt the observed 


value of N. 

Federov in [l] (see also [6]) has determined the chief coefficients 
independently, and considered phases (by determining a cosine 
coefficient when rigid body theory predicts a sine term, and vice 
versa). He appears to find no clear difference from the usual 
ratios and phases which is of much importance for practical 
astronomical purposes. 
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On an Irreversible Dynamical System 
with Two Degrees of Freedom: 
the Restricted Problem 
of Three Bodies 


Introduction. The three series of lectures, which I delivered at the 
Yale Dynamical Astronomy Institutes of 1961, 1962 and 1963 
respectively on the restricted problem, dealt with three different 
faces of this celebrated problem in dynamics. 

In the 1961 series of lectures I set a rather ambitious goal and 
followed a qualitative approach . BirkhofFs and Poincare’s funda- 
mental contributions formed the basis of these lectures, which dealt 
with periodic motions, reducibility, regularization, regions of possible 
motions, etc. 

The 1962 lecture series concentrated on what might be termed the 
formalistic treatment of dynamical problems. The powerful tools of 
celestial mechanics, canonical transformations and variables were 
used and the pertinent equations of motion were derived. Performing 
regularizations within the framework of canonical transformations 
requires the introduction of the concept of the extended phase space. 
Once the method of canonical transformations is extended to this 
2n + 2 = 6 dimensional space, a great variety of forms of the equa- 
tions of the restricted problem are obtainable. As examples, 
Cartesian rectangular coordinates (synodic and sidereal), polar 
coordinates, Delaunay variables, Poincare’s variables, etc. were 
derived. 
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The third series of lectures, delivered in 1963 at Cornell University 
under the auspices of the American Mathematical Society, presented 
the quantitative results of the restricted problem. The numerical work 
of the famous Copenhagen school was used as a basis with which 
G. Darwin’s, F. Moulton’s, and V. Egorov’s results were compared. 
A large number of recent contributions were reviewed and the 
existence of certain families of orbits at various mass ratios was 
established. 

The trilogy described above followed the three approaches to 
dynamics. Because of the different nature of these investigations, 
their results are also substantially different and repetitions are easily 
avoidable. Nevertheless, all three are designed to gain a better 
understanding of the same nonintegrable, irreversible, two-degree- 
of- freedom dynamical system. 

The vast amount of material covered in the three lecture series is 
not suitable for condensation in a short article. A thorough treatise 
of the restricted problem with its applications will shortly appear in 
my book, Theory of orbits — The restricted problem of three bodies , 
Academic Press, New York (to appear in 1966). However, the 
present article should be useful to those readers who wish to become 
familiar with the basic ideas of the restricted problem and who wish 
to see its relation to the general problem of three bodies, to regular- 
ization, etc. A few typical results of the formalistic and qualitative 
approaches will be sketched. The considerable collection of references 
will enable the reader to pursue the subject further. 

Statement of the problem and equations of motion. The problem 
under discussion is the restricted problem of three bodies (probleme 
restreint). Two bodies (assumed to be point masses and called 
primaries) revolve around their center of mass in circular orbits 
under the influence of their mutual gravitational attraction. A third 
body (which is attracted by the previous two but is not influencing 
their motion) moves in the plane defined by the two revolving bodies. 
The problem is to determine the motion of this third body. 

Let the masses of the two primary bodies be and m 2 , their mean 
motion (angular velocity) n , and their distance l (see Figure 1, in 
which / = a + b) . Then 


( 1 ) 

where M = m x + m 2 . 


k 2 M = n 2 P 



152 ♦ VICTOR SZEBEHELY 

The center of mass of the system is located on the line connecting 
rrii and ra 2 , and its distances from m 2 and m l are respectively 

(2) a = — , b = 1CT- 

M M 

Taking the origin, 0, of a fixed inertial coordinate system (X, Y) 
at the mass center, and using r for time, the equations of motion 
referred to this system will be 

d 2 X_ dF d 2 Y _ dF 
~d7~JX' ~d?~dY' 

where F is Poincare’s “force function” or the negative potential 
energy and it is given by 

( 4 ) F-k'l^+a*) 

\P 1 P 2/ 

with p u and p 2 being the distances between the primaries and the 
third body. 

We note that the Hamiltonian, 

(5) 


2L V dr ) ' \ dr ) J * 

is not constant, since F depends explicitly on the time, 

F=F( x, Y, r). 



Figure 1 . The Restricted Problem in Fixed (X, Y) 
and Rotating (£, 77 ) Coordinate Systems 
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Introducing a uniformly rotating coordinate system (£,?/) with 
origin at the mass center, so that m x and m 2 are located on the £ axis 
with coordinates (6,0) and (— a,0), the equations of motion become 


( 6 ) 


where 


d\ 9 d v 

dr 2 ^dr 


/ i n 

d? +2n ~" 


di 

l dr 


dF* 

~W’ 

dF* 

di) 


(7) F* = F+in 2 (£ 2 +„ 2 ) 

and 

(8) p 2 =( ^_ 6) 2 +7?2j p | ==(f + a) 2 =?? 2 > 

The introduction of nondimensional quantities simplifies the equa- 
tions. Let 


* = £/l, y = i/l, = Pl /l , r 2 = P2 /l, 


t = nr, (i — — , and U = F*/l 2 n 2 . 

M 

The equations of motion in nondimensional form are 

d 2 x dy dtl 

2 — = 

dt 2 dt dx' 

(9) 

cry dx d£l 
dt 2 + dt dy' 

and we have the relations 

(10) — 

2 r 2 r x 

and 

(11) rf = (x - P ) 2 + y 2 , r 2 = (x + 1 - M ) 2 + y. 

Equations (9), (10) and (11) represent the problem in conven- 
tional nondimensional quantities. The corresponding physical 
picture is as follows: the two primary bodies are located on the x axis 
which rotates with unit angular velocity; the coordinates of the 
primaries are (^,0) and P 2 (/u — 1,0), and their masses are 1 — P 
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and n, with 0 ^ m = 1> the distance between the primaries, their 
total mass, their angular velocity and the gravitational constant are 
unity. 

The Jacobi integral is obtained by multiplying the first of equa- 
tions (9) by 2dx/dt and the second by 2 dy/dt, adding and integrating: 


( 12 ) 




= 2U - C, 


where C is the constant of integration. 

We note that according to Equations (9) the partial derivatives of 
Q completely determine the problem, therefore adding a constant to 
the expression for Q — as given by Equation (10) — will not change 
Equations (9), but will influence the value of C in Equation (12). 
A symmetrical form of il is obtained by adding the constant quantity 
m( 1 - m)/ 2 to Q. Let Q = 12 + ^*(1 — m), or 

(13) ft — - [(1 — m)^i + v- r i\ 4 I • 

2 r i r 2 

The new Jacobi constant C is related to the previous one by 
C = C + fx{ 1 — m) * 

The final set of equations, with the notations 

dil Oil . dx 

t y ^ j ? etc., 

dx 0y dt 

becomes: 

(14) x - 2y = il x , y + 2x = Q yi 

(15) ( x) 2 +(y) 2 =2n-C , 

where is given by Equation (13). 


Relationship to the general problem of three bodies. The general 
problem of three bodies is defined as follows: three particles attract 
each other according to the Newtonian law of gravitation, they are 
free to move in space and are initially moving in any given manner; 
find their subsequent motion. Their masses are m u m 2 , and m 3y and 
their position vectors from the origin of coordinates are r u r 2 , and r 3 
(see Figure 2). 


AN IRREVERSIBLE DYNAMICAL SYSTEM 


155 


ri2 



Figure 2. The General Problem of Three Bodies 


The potential function of the system is 

T/ l 2 / m i m 2 . m 2 m 3 mgmA 

<16) v ~" i (.m + m + m)- 

The equations of motion can be written as: 

(17) m,r7=-^ i'= 1,2,3, 

OTi 

where the right-hand sides represent the gradients of the potential 
in the directions of the position vectors. These equations can also be 
written as: 




(18) 




r 3 = — k z m l 


~ r i 

i^- ^i 3 


■f k 2 m 2 


r 2 — r 3 

I r 2 - 


Equations (18) describe the general case of the problem of three 
bodies with Newtonian gravitational forces. The structure of these 
equations is of some interest inasmuch as the masses m u m 2 , and m 3 
are missing from the first, second, and third equations respectively. 
This fact does not “uncouple” the equations since all three position 
vectors occur in all three equations. 



156 


VICTOR 5ZEBEHELY 


The terms appearing on the right-hand side have specific physical 
significance. The first term on the right side of the first equation, 
for instance, represents the force per unit mass acting on the first 
body due to the presence of the second body. The second term on the 
right side of the same equation represents the force per unit mass 
acting on the first body due to the presence of the third body. 

Decreasing the mass of the third body will reduce its influence on 
the motion of m x and m 2 , i.e., as m 3 — >0 the first two of Equations 
(18) become 


(19) 


r\ = — k 2 m z 


r i - r 2 

Iri-riV 


r~ 2 = — k l m x 


r 2 ~ r i 
kl-rT’ 


while the third equation will not change. This step does uncouple the 
equations since the motion of m x and of m 2 can now be determined 
without considering the effect of the third mass by solving the 12th 
order system of differential equations (19). 

The third equation of (18) requires comment since if in the first 
two equations m 3 = 0 (which gives Equations (19)) then the third 
equation should be 0 = 0. This is because the usual derivation of the 
third equation of (18) from the third equation of (17) by dividing 
through by m 3 is not permitted when m 3 — 0. Continuing to use the 
third equation of (18) constitutes the approximation which creates 
the restricted problem of three bodies. In effect, the assumption is 
made that ra 3 ^ 0 but that m 3 is sufficiently small so that it does not 
effect the motion of m x and of m 2 . That is, Equations (19) are 
approximate while the third equation of (18) is exact. The system 
of equations consisting of Equations (19) and of the third equation 
of (18) represent the true dynamical system only approximately; 
the degree of approximation is given by the “smallness” of the terms 


- r x 
m 3r = — =r a 

as compared to the terms 

r~x~ r* 


m 2 


In-r!| a 


and 


and 


m 3 


r* 2 - r 3 


r i - r 2 

m if= 

ki- r 2 


respectively. 
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The effects of the first two bodies on the motion of the third is 
given by the third equation of (18). Accepting the above-mentioned 
approximation, one might solve Equations (19), substitute the solu- 
tions into the third equation of (18), and obtain the sixth order 
differential equation for the motion of the third body: 


(20) 


r 3 = —k^rrix 


r 3 ~ r i(0 
|rl-r-(f )| 3 


+ k 1 m 2 


r 2 (0 ~ r s 
- ^| 3 ’ 


where now f\{t) and F 2 {t) are given functions of the time and of the 
initial conditions, and F% is to be determined as a function of time. 

Equation (20) describes the restricted problem of three bodies. 
The “restriction” is equivalent to the assumption according to which 
the motion of the first two bodies is not influenced by the third, while 
the motion of the third is determined by the masses and by the 
motions of the first two. 

Equation (20) can be generalized to 

(21) ^/K^r'W,^),^), 


where F z is the only unknown function of time, m u and m 2 are given 
constants and F^t) and r^(£) are also given as functions of the time 
and of their initial conditions. 

The function /in Equation (21) represents the force field which in 
Equation (20) was the Newtonian gravitational field. In addition to 
Equation (21) the initial conditions of the third body will be needed, 
in order to determine its motion. 

The classification of the various forms of the restricted problem 
follows from the above remarks and is based on Equation (21). 

1. Depending on the force law, /, we speak of Newtonian and 
non-Newtonian problems. 

2. Depending on the initial conditions for r[ and F 2 , in the New- 
tonian case, we speak of the circular or of the general- conic- section 
restricted problems. 

3. Depending on the initial conditions of the third body we dis- 
tinguish between the planar and the three-dimensional restricted 
problems. In a Newtonian gravitational field the first two bodies 
(primaries or principal bodies) will always move in a plane. If the 
third body’s initial velocity vector is in the plane determined by the 
orbits of the primaries, it will stay in this plane; this is the case in 
which Equations (14) and (15) are applicable. 
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4. Further classification is possible by specifying the m^/m^ ratio. 
The Copenhagen restricted problem, for instance, is distinguished by 
using unity for this ratio. 

Reductions of the order of differential equations of motion. The 

fourth order system of Equations (14) can be reduced to a third 
order system by making use of the Jacobi integral, Equation (15). 
One way to accomplish this is by introducing the angle z between 
the tangent to the orbit and the positive x axis as a new dependent 
variable: 

(22) z = arc tan -y - . 

ax 

We will show that the equations of motion become: 



x = K(x,y)cosz, 

(23) 

y = A(x,y)sin 2 , 


Z = — 2 — A^sinz + A y cos2, 

where 


(24) 

A = (2J2- 0 1/2 . 


The proof of the first two equations of (23) is as follows. If s 
denotes the arc length, 

dx dy 

— = COS 2, — = Sin 2, 

as as 

and so 

dx 

x = — s = A cos 2 , y = A sin 2 , 
as 

since the absolute value of the velocity vector s is obtained from the 
Jacobi integral as 

(J) 2 = ( X ) 2 + (y) 2 = 212 - C = A 2 . 

The proof of the third equation of (23) requires the evaluation of 
2 from Equation (22): 

. _ yx - yx 
2 ~ ( s )* ■ 


AN IRREVERSIBLE DYNAMICAL SYSTEM 


159 


Using Equations (14) to eliminate x and y , and the first two equa- 
tions of (23) to eliminate x and y, we obtain the desired result. 

W e note that an alternate approach would be to use the first two 
equations of (23) as definitions of a transformation without reference 
to Equation (22). In this way we have x = A cos 2, y = A sin 2 and 
consequently x = A cos 2 — A2 sin 2 and y = A sin 2+ A z cos 2, from 
which A2 can be obtained; A z = y cos a — x sin 2. By the same elimina- 
tion process as before we can obtain the third equation of (23). 

Equations (23) represent the third-order version of the problem in 
the form of three first-order differential equations with x,y, z as 
dependent and t as the independent variable. The significant fact is 
noted that these equations can be written as 

* = 4>(x,y,z) 9 

(25) y=f(x,y,z) 9 

z= x(x,y,z). 

That is, the right-hand members do not contain the time. This fact 
allows a physical interpretation of the equations by an analogy and 
also assures further reduction of the order by elimination of the time. 

Consider a flow field with velocity vector v~= v{F t t ), where r~is the 
position vector and t is the time. This velocity vector gives a descrip- 
tion of the flow field since at every point r”in the field, at any time 
t, the velocity can be evaluated — excepting singular points. A flow 
field is called steady if 



i.e., if none of the velocity components depend explicitly on the time. 
The velocity components defined by Equations (25) can be inter- 
preted therefore as the description of a steady flow field. 

The continuity equation of hydrodynamics is 

^7 + div(piJ) = 0 . 

Ot 

where p is the fluid density. For an incompressible fluid p = constant 
and the continuity equation becomes 

div iT= 0 , 

or using the notation of Equations (25) 
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<t>x + *Py + Xz — 0 . 

Since the <p, \p, x velocity components, as given by Equations (23) 
satisfy this equation, the dynamical problem is analogous to the 
three-dimensional steady flow of an incompressible fluid. 

It should be remarked that the flow is not a potential flow since 

curliT ^ 0; 

direct computation of the six partial derivatives (f> y , <i>z> 'Px* *Pz> Xx> Xy 

from Equations (23) will establish this. 

The function A contains C, therefore Equations (23) will deter- 
mine a flow field for each given C value. By changing the constant 
of integration, C, the streamline picture will change. For a given C, 
Equations (23) will determine the totality of motions of the dynam- 
ical system and also the corresponding streamlines, providing that 
the inequality. 

{ft) - 2a(x ' y) - c * 0 

is satisfied. 

The streamline representation is singular when A = 0 or when 
A— > oo. The first case corresponds to zero velocity, the second to 
collisions at Pi or P 2 . 

Poincare’s original flow analogy is similar to but not identical in 
detail with the one just given, and should be mentioned also. Let 
x = x u y = x 2 , Jt = x and y = x 4 . Then the equations of motion can 
be written as 

*1 = * 3 , 


x 2 = x 4 , 


~ 2x 4 + 


5Q(jh: 1 ,X2) 


- 2x 3 + 


dx x 

dQ(x l9 x 2 ) 
dx 9 


or 

Xi = FXx i,x 2 ,x 3 ,x 4 ), i = 1, • • -,4. 

It can be seen that divi; ’= 0, i.e., 
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Y d JH 

h dxt 


= 0 , 


so we are dealing with the four-dimensional, steady stream-line flow 
of an incompressible fluid. The actual motion of the particle corre- 
sponds to those streamlines which lie on the 

*3 + *4 “ 2Q ( x l9 *2) + C = 0 


three-dimensional hypersurface. 
Equations (25) can be written as 


dx dy dz 

<f> $ x ’ 


from which 


(27) 


dz = x 
dx <j> ’ 


On the other hand Equation (22) gives 


(28) 


dz 

dx 


— arc tan y' = 
dx J 


y" 

i + (y') 2 ’ 


where y' = dy/dx . 

Substituting for x and <j> in Equation (27) their expressions as 
given by Equations (23), eliminating 2 by Equation (22) and 
equating the right sides of Equations (27) and (28) results in a 
second- order differential equation describing the dynamical system 
(excepting at points where the transformations are singular): 

(29) y" = 1+ t (y) [A y — A y ± 2(1 + (y') 2 ) 1 ' 2 ]. 

A 


It should be noted that elimination of time and use of the Jacobi 
integral can be combined and the above second-order differential 
equation can be obtained in a single step directly from the original 
fourth-order system. The general solution of (29) will contain the 
following three constants of integration: C which is included in A 
and the two constants which enter when (29) is integrated. The 
complete solution of the original fourth- order system requires the 
determination of the time dependence of the variables. This process 
will result in the fourth integration constant. To establish the time 
dependence, we write the Jacobi integral as 
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(*) 2 [1 + (y') 2 } = A 2 , 

where the y = xy' relation was used. The time is evaluated from 
the last equation as 

Here y' and A(x,y) are functions of x only since y{x) has been 
obtained from Equation (29). 

Regularization of the restricted problem. The main purpose of this 
section is to show how regularization is performed within the frame- 
work of canonical variables utilizing the concept of the extended 
phase space. We will conduct the discussion on a generalized level so 
that the reader can use this section as his starting point for the 
solution of new problems. 

The treatment starts with the Hamiltonian in a rotating rectang- 
ular Cartesian coordinate system: 

(30) H = ^ {pi + p£) + q 2 P\ - Q\P2 - F(Qu<h), 

where < 71 , q 2 are uniformly rotating Cartesian rectangular coordinates 
(corresponding to x and y of Equation (14)), and pi, P 2 are fhe con- 
jugate momenta (Pi = x — y } p 2 = y + x). We also recall that the 
gravitational part of il is 



where 

(31) ri = (qi - n)' 2 + ql and rj = (q 1 + 1 - n) 2 + qi 

The generating function to be used is a so-called extended point 
transformation : 


(32) W 3 = p,/(Q„ Q 2 ) + P 2 g(Qu Q 2 ), 

where f and g are conjugate harmonic functions of the new coordin- 
ates Qi and Q 2 - 

The corresponding transformation equations are 


(33) 


dW * A 

q, = — and 

dp, 


P,= 


dW 3 

dQi’ 


1,2. 
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First the coordinate tranformation is performed in the phase space 
then the transformation in the extended phase space is executed. 
Equations (32) and (33) give 

p i + pi = jj(Pi + Pl), 

where 


D(QuQ 2 ) = 



2 

+ 



2 


The new Hamiltonian also requires the computation of the q 2 p i 
— p 2 q x term, which is obtained by substitution: 

M. Pl W* ifZ +gi) ~ P2 Jq} fi + gt) 

Therefore the new Hamiltonian becomes 


,34, H - jL[pi+ n+ (p, 4 ■ -P.4) <*‘ + n - m„ Qi_ , 

where in computing F we write / in place of q x and g in place of q 2 in 
Equations (31). 

The equations of motion are 


(35) 


(36) 


£>. 

It 

§1- 

2P 1 + ^ i (f 2 + g 2 ) 

^ H2 

1 

r s , , i 

Qz = 2 D 

2P 2 -- ? ,(f 2 + g 2 ) 

L ohri J 


■ dH 


1_ ~Wi’ 


. dH 




Inspecting the last term of H we realize that Equations (36) have 
singularities at the primaries. 

In the extended phase space we have the Hamiltonian 


r = P 3 + 


2D 

-WuQJ, 


pi+n+ 



(37) 
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and the equations of motion are reducible to Equations (35) and (36) . 
We now introduce r* = Dr and obtain 

(38) r* - DP, + J[ Pi + Pi + (p,j4 ■ - *4) (f + " Dl 

and with this the equations of motion using the new time variable 
(t) can be established. The relation between dt and dt is dt = Ddt 
since T* = Dr. Denoting derivatives with respect to dt by primes, 
we have 


Q'.= 


ar* 

3P? 


p;= - 


ar* 

M’ 


i = 1,2,3, 


or 


(39) 


^- p ' + \k if+s% 


«-D. 

The last equation is dt = Ddt as expected. Introducing 

<t>(Qu Qz) — f(Q\, Qz) + is(Q\i Q2), 


and 


| 2 = f + 8 2 , 


the second set of the Hamiltonian equations of motion becomes 


pr _ _ p dP _ I p d _ p | , 1 2 , d ( pp\ 

1 Ps aQ, 2l P, dQ l aQ, P *dQU 01 + aQ, ’ 


(40) 


P£ = -P*T^- 


PJ = 0. 


aQ* 2L Pl aQl 2 aQ 2 aQj kl 




The last equation expresses the fact that H = constant. The singu- 
larity problem appears in the last term of the first two equations of 
the system (40) and it can be solved by selecting the proper D func- 
tion, i.e., specifying / or g. 

Equations (39) and (40) represent a sixth-order system, or 
omitting the last equation in both groups, we have a fourth-order 
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system of equations which is equivalent with Equations (14). To 
show this the reader will have to compute Q" and Q" from Equations 
(39) and substitute in the resulting equations the values of P[ and 
P' 2 as given by Equations (40). The two second-order equations 
obtained this way are 


(41) 


« + 2 D «- 4 D G |4l ‘- p,+? )' 


F or further comparison we recall that | (f> | 2 = q\ + q'i — r z by 
Equations (33). From this it follows that 


-|0| 2 — P 3 + F= Q — ~fx( 1 — fi) + H, 


or 


(42) 



F*3 T E = ft — 


Substituting Equation (42) into (41) gives 


(43) 


Qf - 2D «-4 D ( a -f)- 

«+ 2D «=4°( a -f)- 


Finally, using u for Q u v for Q 2 » Qz) for D, and il* for 
D (12— C/2), we have 


(44) 


u" - 2\v' = 
u"+ 2Xa 7 = Q* f 


a special case of which is Equation (14), viz. X = 1. 

Generalization of the problem. Equations (44) represent the re- 
stricted problem in the (u, v, t) system which is obtainable from the 
(jc, y, t) set of variables by the relations given in the previous chapter. 
It is essential to note that the type of regularizing transformations 
introduced does not change the general form of the equations. This 
remarkable invariance suggests the importance of the equations of 
the type given by (44). But one might go even further in the general- 
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ization of the problem. The class of dynamical problems described by 
the Lagrangian 

(45) L = ^(qi + ql) +qMq u q 2 ) +^(9i,9 2 ) + y(<7i,<7 2 ) 

leads also to equations like (44), since the equations of motion will 
contain only certain combinations of the three functions a,p, 7 ; i.e., 
the dynamics of the system can be represented by two functions 
only — corresponding to X and il*. 

Another example leading to Equations (44) is the elliptic restricted 
problem, also called the semirestricted problem. Here the primaries 
move on elliptic orbits instead of circles. By selecting proper vari- 
ables this dynamical problem is also reducible to equations identical 
in form to Equations (44), with the important difference that the 
dependent variable appears explicitly in il*. 

Generalization of the variables which describe the dynamical 
system and generalization of the dynamical system itself leaves 
Equations (44) invariant, as I have demonstrated with some 
examples above. Equations (44) describe the conventional probleme 
restreint in a Cartesian rectangular synodic coordinate system with 
X = 1 , with appropriate il and with the time as the independent 
variable. If X = \(u,v) and the time is transformed as described in 
the previous chapter, we obtain the regularized equations of motion. 
With X ^ 1 and with the true anomaly as the independent variable, 
the elliptic problem is represented. Finally with X = — «</ 2 ) 

Equations (44) represent all dynamical systems whose Lagrangian is 
given by Equation (45). 

The general importance of Equations (44) in dynamics was recog- 
nized by Poincare and by G. D. Birkhoff. The presence of the first- 
derivative terms (referring to either Equations (44) or (14)) render 
these equations irreversible since changing the sign of the indepen- 
dent variable does not preserve the equations. The structure of the 
equations is such that as long as the independent variable does not 
appear explicitly in U, an integral of the system is always available 
for any X function. This is not surprising since the appearance of the 
X function is not an essential part of the dynamical description of the 
system. The major difficulty of the dynamical system originates 
partly from the fact that X ^ 0 and partly from the complexity of the 
il (or S 2 *) function. 
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If the problem is changed into the discussion of a single-degree-of- 
freedom dynamical system, the situation is altered completely and 
one gets x = f(x), an unquestionably uninteresting equation. 

The essential features of two-degrees-of-freedom irreversible 
dynamical systems are therefore contained in the equations 

x - 2 y= tt x , 

y + 2x ~ 

The fact that these equations also describe the restricted problem 
of three bodies is rather significant since the restricted problem 
occupies a critical place in celestial mechanics being the simplest 
nonintegrable problem of definite physical importance. 
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Problems of 
Stellar Dynamics 


I. Introduction. Stellar dynamics is a relatively recent branch 
of dynamical astronomy. It has been developed mainly in the 
present century as a part of astrophysics. Many authors have 
considered dynamical problems in their study of stellar clusters, 
the Galaxy and the other galaxies, or clusters of galaxies. 

However there are some books devoted exclusively to stellar 
dynamics as a special discipline. 

Smart’s book, Stellar dynamics [l], deals mainly with the kine- 
matical aspects of stellar dynamics, namely the solar motion, the 
two star-streams and the ellipsoidal theory of stellar velocities, the 
statistics of the distribution of stars around the sun, etc. Only a 
minor part is devoted to the dynamical problems of stellar clusters 
and of the Galaxy. However this book includes a lot of useful 
material. 

A classical book in stellar dynamics is Chandrasekhar’s Principles 
of stellar dynamics [2], Chandrasekhar’s book is mainly devoted 
to his own work on stellar systems. He finds first a complete formula 
for the time of relaxation of a stellar system, and then he discusses 
at length the quadratic solutions of Liouville’s equation. Some 
applications of his work on the dynamics of the clusters and the 
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Galaxy are very interesting. The Dover edition of this book contains 
also Chandrasekhar’s discussion of dynamical friction and an ex- 
position of his views concerning the statistical mechanics of 
stellar systems. 

A broadly similar pattern is used by von der Pahlen in his 
Einfuhrung in die Dynamik von Sternsystemen [3]. However, 
von der Pahlen insists more on the observational part of stellar 
dynamics. 

A rather different approach is due to Kurth, An introduction 
to the dynamics of stellar systems [4]. Kurth discusses the three 
main approaches to the problems of stellar dynamics, the n-body 
problem approach, the continuum approach, and the statistical 
approach. He gives a number of interesting theorems concerning 
the first two approaches (some of them not well known before), 
but he disregards completely the third approach, which he con- 
siders as impossible. 

Galactic dynamics has been considered by B. Lindblad in a 
very good review paper in the Handbuch der Physik [5]. A rather 
short review, Recent developments in stellar dynamics , is given by 
Camm in Vistas in Astronomy [6]. 

Two Russian books on this subject have appeared lately. One 
is Ogorodnikov’s The dynamics of stellar systems [7]; this is to be 
published in English by Pergamon Press. It deals with the kine- 
matics and dynamics of stellar systems in general and of the 
Galaxy in particular. 

The second book is Idlis’ Structure and dynamics of stellar 
systems [8]. It deals with the dynamics of the Galaxy and of the 
other galaxies, considered as stationary stellar systems. 

In general all these authors consider stellar dynamics quite 
separately from classical celestial mechanics. However, both 
stellar dynamics and celestial mechanics are parts of dynamical 
astronomy. Their subject matter is essentially the same the 
gravitational interaction of a number of material bodies — and 
some of their methods are similar. 

Of course there is one important difference between them. The 
number of gravitating bodies in the case of stellar dynamics is much 
greater than in celestial mechanics, and this introduces a number 
of new and very difficult problems. In order to discuss these new 
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problems, new methods have to be introduced in many cases. 
However, there are many cases where stellar dynamics can profit 
from the methods and experience of the classical celestial mechanics 
and, likewise, methods developed in stellar dynamics can be 
applied to problems of the solar system. 

This interchange of ideas has not been done to a satisfactory 
degree until now. It would be profitable to bring these two 
disciplines much more closely together into a unified whole. 

The present chapter contains essentially my lectures at the Y ale 
University Observatory in 1962. It deals with some of the major 
problems in this field and indicates the points where research is 
under way or should be desirable. Further it gives the necessary 
references until 1962, or 1963 in some cases. 

Three different techniques have been used in most work in stellar 
dynamics. Our treatment will discuss these more or less separately, 
and we now indicate what the three are. 

(a) The n-body problem approach . Any stellar system is com- 
posed of n gravitating bodies, interacting according to the law of 
Newton. The equations of motion can be solved numerically if the 
number of stars is small, and this often gives an adequate solution 
of the problem. Such integrations have been done lately by S. von 
Horner [9], [10] and have given interesting results. 

However, if the number of the stars is great, even the most 
powerful electronic computers are unable to calculate their motions. 
On the other hand one can find a number of general theorems about 
a system of n bodies that can give us some information about the 
behavior of the system in general. Such theorems are the virial 
theorem and the theorems of Poincare and Hopf. Unfortunately 
no complete description of the behavior and the evolution of stellar 
systems can be effected by the available theorems. Therefore the 
most accurate method of dealing with the problems of stellar dy- 
namics is, in general, insufficient. 

(b) The continuum approach. When the number of stars in a stellar 
system is big enough we may consider the system as a gravitating 
continuum, i.e., we may consider the smoothed out gravitational 
field that we would have if the stars were pulverized and their 
matter distributed evenly in the interstellar space. More accurately, 
we consider that the phase space is evenly filled by points repre- 
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senting the positions and velocities of the stars in the actual space. 

This procedure is justified as regards the action of the distant 
stars on any single star of a stellar system; because in that case 
the potential of a great number of stars is almost the same as that 
of a gravitating continuum. However, this approach neglects the 
interaction of a star with its neighbors. In a close approach of two 
stars, their motion is influenced mainly by each other and the action 
of the distant stars may be considered only as a perturbation. This 
is the main limitation of the continuum approach. 

However, in most stellar systems the time of relaxation is very 
great, so that in the mean the close encounters have a relatively 
small effect on the dynamics of the stars. In general a star can 
make at least some revolutions (sometimes a great number of revolu- 
tions) around the center of the system before its orbit is changed 
appreciably because of close encounters with other stars. 

(c) The statistical approach. In most cases statistical methods are 
used in stellar dynamics. The calculation of the time of relaxation 
or of the mean free path of a stellar system, and the study of the 
distribution of velocities or of the density distribution of a cluster, 
require such methods. It is true that this approach has one basic 
difficulty; a consistent definition of probability is, in most cases, 
not given. However the statistical methods have given many in- 
teresting results about stellar systems in general. The verification 
of these results is one of the main problems of stellar dynamics 
at the present moment. 

II. The n-body problem approach. 

1. The Virial Theorem. The classical virial theorem is due to 
Lagrange and Jacobi. Lagrange was the first to apply it in the 
problem of three bodies; Jacobi applied it to the n-body problem. 

The equations of motion of the n-body problem are 
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(2) 


G 


r 

^ i' = l ;'= 1 r \ 


m ir rtij 


(i*j) 


is the total potential energy of the system and r tj is the distance 
between the stars i and j. 

Let the distances of the n bodies from their center of mass be 
then the moment of inertia about the center of mass is 

(3) J =^rriif?. 

r = l 


By differentiating it twice we find 

(4) i j ='i^ni i f?+J^miF i r i . 
^ 1=1 1=1 

But 

(5) Y* m iri =< ZT, 

i—\ 


where T is the kinetic energy of the system with respect to its 
center of mass. 

Further 


Z m.r.ri = X rriiiXiXi + y t y t + 2,2,) 


(6) 


1= 1 


" (dV dV dV \ 

— ~ , 1 ~ — x i + — — yi + — — % J 

i=1 \dXi dyi dZi / 


and this last quantity is equal to V , because V is homogeneous 
of degree — 1 with respect to x if y if z t . 

Hence 

(7) ij=2T+V, 

and as T + V = E (the total energy), we get 

(8) i J = E + T = 2E - V. 


This is the Lagrange equality. 
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If a system is stationary, so that its form does not change in 
the mean, then J = 0 and 

(9) 2T+V=0. 

This equation is usually called the Virial Theorem. A more general 
form of the same equation is 

(10) (2T) + (V) = 0, 

where the symbol ( ) means time average for f— > This equation 

is valid when a system remains bounded. It would be valid also 
in a system expanding slowly, so that J — > const. (J— >0). But such 
a case is probably exceptional. 

Some important consequences can be drawn from the Lagrange 
equation. If E > 0, then ^j>E>0. 

Hence J>2Et + c and J>Et 2 + ct + c where c and c' are 
constants. Therefore J — > °° as ^ — > °° , and if m is the greatest 
mass of any star, 

n 

( 11 ) rh'Z.rf^J; 

1-1 

i.e., the distance of the representative point (x„y,, 2 ,) (in the space 
of 3 n dimensions) from the origin tends to infinity. 



Figure 1. Oscillations of Two Stars 
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This does not necessarily mean that one star will actually escape 
from the system; in fact, although the sum of the squares of the 
distances of all the stars from the center of mass increases con- 
tinuously, the individual distances may oscillate in such a way 
that no star actually escapes from the system. This fact is illustrated 
by Figure 1, where the distances of two stars from the center of 
mass oscillate alternatively with increasing oscillations. In real sys- 
tems once a star has gone very far, it comes under the influence 
of other fields and can be considered as lost from the original system. 
However it would be of great theoretical interest to determine 
under what conditions oscillations of the form of Figure 1 are pos- 
sible in an isolated cluster. 

It has been assumed by some authors that the escape of stars 
from a cluster must be effected in pairs or even triples, in order 
that the center of mass should not move. However this is not neces- 
sary. It is most probable that only one star is ejected at a time, 
while the rest of the cluster recedes with respect to the center of 
mass so that the equalities 

n n n 

'Z,m i x i ='£ niiyi = £ m.z, = 0 
i = 1 i = 1 1=1 

persist. 

If E > 0, and one star is ejected, it is most probable that the 
energy of the remaining cluster with respect to its new center of 
mass is again positive; therefore another star also will probably 
be ejected, and so on. It is for this reason that stellar systems of 
positive energy are disintegrating. Such systems are the associa- 
tions which are expanding with rather big velocities and disintegrate 
completely in some millions of years. 

If E < 0 there is almost no known criterion to decide whether 
a stellar system will eject stars or not. A partial answer to this 
problem is given by Hopfs theorem, that is mentioned below. 
Further work on this problem has been done by Chazy [l 1 ] , [12], 
[13] in the case of the three-body problem, and by the Russian 
authors Khilmi, Merman, Schmidt, etc. A book by Khilmi, Quali- 
tative methods in the many body problem , has recently been translated 
into English and German [14]. One can find there references to the 
Russian literature on the n-body problem. Further a Review of 
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Soviet celestial mechanics literature including a great amount of 
material has been published by the U.S. Department of Com- 
merce [15]. 

A number of systematic generalizations of the classical virial 
theorem have been given lately by Chandrasekhar [16], [17]. 
These are the tensor virial theorem, and the virial theorems of 
order higher than the second. The virial theorem for the ^-bodies 
in the post-Newtonian approximation of general relativity has 
been considered by Chandrasekhar and Contopoulos [18]. 

If we multiply the first equation of the system (1) by y t and 
add all the terms with different i we get: 


( 12 ) 


" n a V 

'Lm t x i y i = 

i= 1 i = 1 ax i 


or 


v-' ^ / • \ v' Grrij(Xi Xj) 

Z mi — (xiyi) - Z MiXiyi = - Z m .>'* Z 3 

i=i dt i-i 


i = l 


(13) 


ly y> Gm.nijix, - x ; ) ( y, - y>) 

( = 1 i ' IJ 


If we write 

1 " " Gmitnjixi — Xj)(yi- yj) _ y 

(14) - 5 Z Z r » “ Vl >- 

Z i= 1 f ij 

and 

1 n 

(15) - '£m l x l y l = 7^, 

Z i = i 

then 

« d 

(16) 27^+ T.m i -^(x i y i ). 

and are the potential energy tensor and the kinetic energy 
tensor respectively. 

In general if we multiply the first equation of the system (1) 
by x°y-z c i and add the corresponding terms we find 
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(17) 


or 


Z m &iXiyUi = - Z 

£1 TTi d*; 


iz m ‘ x ‘ x '‘y^ z i - a Z »».•*?*? V.-zf 

dt I=1 I=1 


- bY^m i x i y i x'iy b i c^m i x i z i x a i y b i z e i 1 


(18) 


= -z z 

i = l 


Gm i m J x‘iy\z c i( x i - x,) 


1 ^ ^ Gm l m I {x tt i y b l z1 - tfyfzj) (x , - xj) 

o „3 

Z 1 = 1 ; = 1; 


i;;V* 


But if we set 

(19) Xi = x, + (Xi - xj) 9 y t = y ; + (y,-- y>), 2 , = e> + te* - 
we find 

*“y?z' - XjyjZ, 

( 20 ) a- 1 6-1 c-1 

= Z Z Z c“C£c r c xr p yMzr(x, - ^) p (y, - %)*(* - *,) r . 

p=l g— 1 r=l 

Thus if we write 

(2D tm i i*yi£i*iy!z!-T* rM 

1 = 1 

and 


/I n 


( 22 ) - 1 z z am,mj(x ‘ ~ ~ ~ = v 

2 ;=i rfj 

we find 


abc;defj 


il 

7'm.abc = a T-2M,a- 1,6c + 67\lO;a,6-l,c + cT^oijokc-l 


+ Z Z Z C P “C*C; ^P+l,9,r;a-M“9,c-r* 

p = l 9 =i r= 1 


(23) 
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Figure 2. Integration Element in a Star Cluster 


(a) Applications of the Virial Theorem . A well-known application 
of the classical virial theorem is for the determination of the mass 
of a cluster in a stationary state. If we define a mean square velocity 
by the formula T = Mu 2 / 2 where M is the total mass, Equation 
(9) for a system of stars of equal masses gives 


(24) 


Mir 


fz E 

Z i-1 


Q n ^ n 


1) m 2 


GM 2 



where r* is defined by 


(25) 


n(n — 1) 


-in n 


1 


We call r* the radius of the cluster; this is defined accurately 
enough in the case of spherical clusters by a method devised by 
M. Schwarzschild [19] which is based on star counts on a photograph 
of the cluster( 1 ). 

Let the number of star images between two parallel straight lines 
whose distances from the center are q and q + dq be S(q)dq (Figure 
2). These images are the projections of all the stars between two 
parallel planes. On each plane the curves of constant density p are 
circles of radius z = (r 2 — q 2 ) 1/2 . Thus 


( L ) Chandrasekhar, Kurth and others give a radius R* that is approximately 
equal to one half of r*. 
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(26) 


S(q)dq = I p2ivzdzdq. 
o 


But we have zdz = rdr ; therefore 


(27) S(q) = £ p2*rdr. 

Instead of <» we may set as upper limit a visible boundary of 
the cluster, say 

The gravitational potential of the cluster is 

(28) V = - f " 4wr 2 p dr =2 G f M(q) dS(q), 

Jo r Jo 

because 

(29) dS(q) = — p(q) 2?r qdq. 

Hence 

(30) V = [2 GM(q) S(q) ] 0 “ - 2 G j g S(q) 4*q 2 pdq. 

The first term is zero because M( 0) = 0 and S(°o) = 0 (at the 
boundary). Therefore 


(31) 


V=-4 g£ qS(q)dS(q) 

= [2GqS 2 (q)]o - 2 G f g ~S 2 (q)dq 
= - 2 gJ o S 2 (q) dq. 


On the other hand 


( 32 ) 


/> oo OO 

M= l 4 W r 2 pdr= — 2 I qdS(q) 
= [-2qS(q)]o+2j o S(q)dq 
= 2 J g S(q)dq. 
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V = 


GM 2 


r 


* 


gives 

(34) 


2 (jrSdq) * 
Jo S 2 dq 


From equation (24) we get 
(35) M = 


r*~iF 

~G 




3 r*vi 
~G~ 


where (i^) 12 is the root mean square radial velocity, that is 
measured spectroscopically. This method has been used success- 
fully for finding the masses of the globular clusters. 

The same method has been used lately to find the masses of 
clusters of galaxies. There, however, a serious discrepancy has been 
observed. If M is the mass of the cluster and n the number of the 
galaxies, the mass of each galaxy is calculated from the formula 
m = M/n. On the other hand the masses of the individual galaxies 
can be found by measuring spectroscopically the rotational velocity 
of each galaxy with respect to its own center of mass. However the 
masses found by the latter method are much smaller than the 
masses calculated by means of the virial theorem. Two main ex- 
planations of this discrepancy have been proposed. One is that the 
virial theorem is not applicable in this case because the system is 
rapidly expanding, so that J > 0 (not only J > 0). Then 


- j= M [v 


GM\ n j . . r*1F 
— ) > 0 and M < . 


This view has been advocated by Ambartsumian [20]. The second 
explanation is that the clusters contain a great amount of inter- 
galactic matter (or the galaxies extend much further than the limits 
that can be reached spectroscopically); therefore the mass of the 
cluster is bigger than the mass of the observable parts of its galaxies, 
M > nm. 

A lengthy discussion of these views took place at the Santa 
Barbara Conference on the Instability of Systems of Galaxies in 
1961 [21]. No definite conclusion has been reached yet. 
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Chandrasekhar and Lebovitz [22], [23] have applied the tensor 
virial theorems in problems concerning the equilibrium of rotating 
fluids. 

A few applications of generalized virial theorems to nonspherical 
systems have been given by van Wijk [24] and King [25]. Further 
extension of this work should be useful. 

Limber [26], [27], [28] has given a form of virial theorem that 
takes account of the interstellar matter also. 

2. Poincare’s Recurrence Theorem. One of the most important 
theorems concerning the long range evolution of certain dynamical 
systems is the “recurrence theorem” of Poincare. This theorem has 
been given by Poincare in his Methodes nouvelles de la mecanique 
celeste [29, Chapter 26]. Its most accurate form, however, is due 
to Caratheodory [30]. 

Suppose that we have a continuous, measure-preserving flow in 
a space of N dimensions, defined by the equations 

Xi 0i(X()> 0 , 

where x 0 (x l0 , x^, * • • , x N0 ) gives the initial conditions for t = 0. We 
assume that 0(0 (x 0> *i) ,t 2 ) = </>(x 0 , t x + t 2 ), where 0 represents the set 
of functions 0 b 0 2 , ••*,0^. 

Also let there be an invariant set ft with finite measure M (i.e., 
the trajectories issuing from all the points of ft are contained inside 
ft). We can assimilate ft to a closed vessel inside which moves an 
incompressible fluid. 

We shall prove now that the trajectories issuing from almost any 
point x 0 of ft come an infinite number of times in the neighborhood 
of x 0 . More accurately, x 0 is a point of accumulation of the points 
(images) 0(x o , r), 0(x o , 2r) •••, where r is a definite positive or nega- 
tive number. “Almost any” means that the measure of the excep- 
tional points, where this property does not hold, is zero. 

Suppose that we have a set w of points x 0 of measure m, inside 
ft. The set of the points 0(x o , r), where x 0 £ w , has the same measure 
m. Let this set be written t). We shall prove that any set w 
of measure m > 0, has points in common with <t>{w,qT), for some 
value of q. 

In fact, if w and r), <j){w y 2t), • • •, 4>(w, qr) have no points in 
common, then their total measure is (q+l)m. This, however, is 
greater than M if q + 1 > M/m; therefore some sets of the above 
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sequence have a common part, e.g., ^(w.q^) and 4>(w,q 2 r). But we 
know that 0(</>(x o, (<72 ~ 9i) T ),<7i^) = <P(*o, < 72 t), i.e., to the above com- 
mon part there corresponds a common part between w and 
<t>(w,(q 2 - qd t). (See Figure 3.) 



Suppose now that A d is the set of the points x 0 all of whose 
images have a distance from x Q greater than d > 0. We shall prove 
that the measure of A d is zero. In fact, if m(A d ) > 0 we may separate 
A into a number of sets whose greatest dimension is smaller than 
d. Then the measure of at least one of these sets, w , is greater than 
zero. But we have proved that this set has points in common with 
(f>(w,qr) for some q. This means that for some points of w the 
distance from their images is smaller than d, contrary to our as- 
sumption. Hence m(A d ) = 0. This is true for each d; therefore the 
measure of the points whose images do not come arbitrarily near 
to them is zero. 

The theorem of Poincare does not mean that such exceptions 
do not occur. It means only that the probability of these excep- 
tions is zero, with the measure- theoretical definition of probability. 
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3. The Theorems of Hopf. If there is no invariant set SI of finite 
measure then we have a generalization of Poincare’s theorem due 
to Hopf [31]. 

Hopf calls “escape points” the points whose images <j>(x 0 , r), 
<t>(x 0 , 2r), • • • have no accumulation point in SI. Then according 
to the Weierstrass-Bolzano theorem these images will have a point 
of accumulation at the boundary of SI. On the other hand the 
points that are accumulation points of their images are called 
“recurrent points.” Then Hopf’s first theorem states that almost 
all the points of SI are either recurrent or escape points. It can be 
proved further that the escape points which do not go to infinity 
form a set of measure zero. Thus almost all points of SI are either 
recurrent, or they go to infinity. 

Of more importance is Hopfs second theorem. In order to 
state this theorem we distinguish between the two directions of 
time and call a point recurrent or escape point with respect to 
the past or the future. The second theorem of Hopf states that 
almost all the points that are escape (or recurrent) points with 
respect to the past are also escape (or recurrent) points with 
respect to the future. 

Let B be the set of the escape points with respect to the past 
inside any bounded set UiCZSl. B is the union of the sets B n , where 
B n is the set of points x 0i whose images (x 0 , — qr) are all outside 
U t for q > n. Then B n has no points in common with the sets 
<f>(B n , — qr),(f)(B n , — 2 qr), • and consequently any two such sets 
have no point in common. It follows that B n , </>(£„, qr), cf>(B n , 2<?t), ••• 
have no point in common either; for otherwise B n should have 
points in common with some set <f>{B ny — q'qr). Then it can be 
proved that almost all points of B n are escape points with respect 
to the future; in fact if we separate SI into bounded sets U n there 
cannot be a subset of points of B n of positive measure that have 
accumulation points in any U ly because then some images of B n 
should overlap, according to the first part of Poincare’s theorem 
and we have seen that this is impossible. 

We conclude that almost all the escape points with respect to 
the past are also escape points with respect to the future. Further, 
as almost all points are either recurrent or escape (in both directions 
of time), almost all recurrent points with respect to the past are 
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also recurrent points with respect to the future. 

Hopf s theorem does not exclude orbits of the form 4a that 
are neither recurrent nor escape, or 4b, that are escape points 
with respect to the future only (see Figure 4), but states that such 
orbits have a probability zero. 



a 

Figure 4. Exceptional Orbits 

4. Applications of Poincare’s and Hopf’s Theorems . Poincare’s 
theorem can be applied to the motion of particles in a stationary 
field only if there is an invariant set of finite measure. In general 
this happens if the potential energy V has a lower bound. The 
set 12 of points whose total energy is between two finite limits 
a < E < b is invariant, because the energy E is invariant along 
any trajectory of the representative point in phase space (i.e., 
the point that represents the positions and velocities of all the 
particles). If V has a lower bound V Bf then 

V B ^V=E~T<b-T^b\ 

hence 

T < - V B + 6, 

i.e., |x,|, \y ,\ , | r,| are all bounded. 

Further, V B g V <b, and if we take b < V„, then the space 
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coordinates that satisfy this inequality are bounded; hence Q 
is bounded. 

We must also state that in this case the flow in the phase 
space ( 2 ) r of the system is measure preserving. We have 


(36) 


dx( dxi 

W^ Xi ’ ~di 


and the expression 


dV 

dx i ’ 


(i = 1,2, 


Y f ds, 3y.~ dZj d /dV\ d /dV\ ) 

1 dXi dy, dZi dx, \ dXi ) dy, V dy t ) dz, \ dz; / ) 


is zero, because all the terms are zero. This is a special case of 
the following theorem: If in a system of N differential equations 
x = fix, t) we have the condition 


z 


n f 

— - = 0, then A = 
dx i 


D(x i, x 2> ---,x N ) 
D(x 10 , X 2 o, * * *, Xno) 


= 1, 


where x is a solution of these equations and x 0 the initial conditions. 
This theorem is easily proved if we remark that 


dA 

dt 


= aX o. 

dXi 


Then the measure Jhdx (in N dimensions) of a set of points 
that occupy initially a set of measure Jh 0 dx 0 is J^dx — Jl f{) Adx 0 
= JM 0 dx 0 , i.e., the flow is measure preserving (incompressible). 

In the case of the n-body problem we have a measure preserving 
flow; however, the potential V has no lower bound. In fact, near 
the gravitating bodies — co , and the corresponding velocities may 
increase beyond any bound; therefore in the n-body problem there 
seems to be no invariant set of finite measure. 

However there is at least one exception: this is the case of the 
plane restricted 3-body problem [32, p. 206-207]. Then E 
= (x‘t + x'l)/ 2 + V, where V = (r 2 / 2) - (/i/rj - ((1 - n)/r 2 ), (r, r u 
r 2 are the distances of the moving point from the origin and from 
the masses ^ and (1 — n) respectively). If r, r u r 2 are constant, 


( 2 ) We distinguish between the phase space of the molecule (/x-space, Boltzmann), 
that has 2 r dimensions, where r are the degrees of freedom of each molecule, and 
the phase space of the system (r-space, Gibbs) that has 2 rn dimensions. 
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V is constant. Then the inequalities a < E <b are written 2 (a — V) 

< xf + x\ < 2(6 — V); i.e., the point (x u x 2 ) is inside a circular 
annulus of measure w{[2(b — V)] — [2 (a — V)]) = 2x(6 — a), and 
this is independent of x u x 2 . Hence if the x u x 2 space is finite (this 
happens e.g., near each one of the two masses, inside the corre- 
sponding closed Hill’s curves, where V is smaller than a certain 
value c), then the invariant set $2 in the phase space has a finite 
measure; therefore Poincare’s theorem can be applied. 

However the corresponding measure of the iu x 2 , x 3 space in 3 
dimensions tends to infinity as V — > 

Some recent results of the Russian mathematician Arnol d [33] 
indicate that in most cases of the 3 or ft -body problem in 2 or 3 
dimensions there are boundaries in finite distance that the moving 
point cannot cross. Therefore in these cases Poincare’s theorem 
is applicable and the system is recurrent. 

In the most general case we can apply Hopf s first theorem and 
state that in the ra-body problem either the system is recurrent 
so that it returns an infinite number of times near its initial state, 
or the representative point goes to infinity. The exceptional cases 
of nonrecurrent systems whose corresponding points do not go to 
infinity, have a probability zero. Hopf s second theorem now states 
that the corresponding points of almost every system will go to 
infinity if and only if they have come from infinity. This means 
that if a star escapes from a system, then either this star or another 
one was captured by the system, or the system was initially oscil- 
lating in the opposite way to that shown in Figure 1. Similarly 
the capture of a star by a cluster is, in general, temporary, and the 
system ejects later one star (not necessarily the captured one), or 
it becomes oscillating in the above sense. 

A similar theorem was proved earlier by K. Schwarzschild [34] 
who found that the captures of comets by Jupiter are only 
temporary ( 3 ). 

Another consequence of Poincare’s and Hopf’s theorems is that 
a stellar system cannot tend towards a special final form unless it 

( 3 ) See also Chazy [13, p. 421AF. |. Chazy asserts that if a third body approaches 
a binary system from infinity it cannot form a stable triple system, or cause the 
disruption of the binary, or even be captured by one component of the binary 
while the other component escapes. However it seems that the last two possibilities 
exist. L. Becker [35], [36 1 calculated some orbits in the 3-body problem that 
resulted in the capture of a third body by a binary, while one of the components 
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is either stationary or it is disintegrating. E.g., a rotating system 
cannot become flatter, if only conservative forces are applied. In 
fact, the flattening of the stellar systems must be ascribed to the 
friction of the prestellar gas. We mention some other applications 
of Poincare’s theorem in §111.3. These examples show the usefulness 
of topological methods in some problems of stellar dynamics. 

5. Some Problems . Some unsolved problems should be mentioned 
in connection with the above theorems. 

(a) If for very small distances the law of attraction of two stars 
is changed so that V does not tend to infinity, then one might 
apply Poincare’s theorem and state that all stellar systems are 
recurrent. But then the stars should not approach beyond a certain 
limit; e.g., they should never collide physically. This shows the 
connection between the problems of escape and of very close 
approaches in a stellar system. It should be of the greatest interest 
to find under what conditions the distances of the stars of a given 
system never become smaller than a given amount. 

(b) As the criterion of Hopf cannot be verified in general (except 
in the case of recent captures), we need more practical criteria in 
order to determine whether a given system will eject stars or not. 

By applying the statistical methods of the third chapter we shall 
see that in general a stellar system will eject stars. However, as 
the dynamical phenomena are time reversible, this means that it 
is equally likely to have an ejection in the inverse direction of time, 
i.e., a capture. This is the main idea underlying Hopfs second 
theorem. But it seems that the phenomenon of escape is very rare, 
and one should not disregard the possibility that in some cases of 
isolated clusters it never happens. In fact the numerical calcula- 
tions of von Horner, that we shall mention later, show that escapes 
of stars are much rarer than it was generally believed until now. 

There are some reasons why the escape of stars is a very im- 
probable phenomenon. If the velocity of a star gradually increases 
because of close encounters, the star moves to the outer parts of 


of the binary escaped. Further O. Schmidt, Khilmi and other Russian authors 
proved, by means of numerical calculations also, that a third body can cause the 
disruption of a binary. If then we invert the time direction we find that 3 stars 
moving in special hyperbolic orbits can form a binary (see [14], [15], [37]). This 
phenomenon has a probability greater than zero. Similar results were found by 
Sibahara [38], [39], [40]. This work should be extended further. 
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the cluster where the encounters are much more rare, and therefore, 
as pointed out by Henon [41], [42], it cannot really escape. Only 
in the case of a very close encounter can a star acquire abruptly 
the escape velocity. 

If, however, a quasi-stable state is reached in a stellar system, 
it may be that such very close encounters never occur. The problem 
is worthy of further consideration. 

(c) It should be of interest to find in what cases the corresponding 
points of stellar systems are recurrent points that go also to infinity. 
The existence of such oscillating orbits has been proved lately by 
Sitnikov [43]. Sitnikov found a case of the 3-body problem where 
one body performs oscillations of greater and greater (unbounded) 
amplitudes, but it returns infinitely many times near the origin. 
Further, one should find under what circumstances two stars may 
oscillate alternatively, as in Figure 1. 

Some very interesting work in classifying the different types of 
evolution of a system of three bodies was done by Chazy [ 1 1 ], 
[12], [13]. A number of Russian authors have also worked in this 
field (see [14], [15]). Further work of this type in the n-body prob- 
lem should be very important. 

(d) A mean recurrence time can be calculated for recurrent sys- 
tems as follows: If M is the measure of an invariant set and m 
the measure of a sphere within which the point representing the 
system in the phase space must return, then the mean recurrence 
time is A/r/m, where r = d/v ( d is the diameter of the sphere and 
v the “velocity” of the moving point). 

Can such a mean recurrence time be defined in the case that the 
invariant measure M is infinite, and, if so, what is its order of 
magnitude? 

Consider for example two clusters, an open cluster with 100 stars 
and a globular cluster with 10 5 stars, and let their respective di- 
ameters be lOpc and lOOpc respectively. Their mean escape veloci- 
ties are approximately 0.4 km/sec and 4 km/sec. Suppose that all 
the stars are always contained inside the above limits of the clusters 
and their velocities are smaller than the corresponding escape 
velocities. Then the recurrence time needed for all the stars to 
return within 2 A.U. from their initial positions with velocities 
differing less than 0.1 km/sec from their initial values is greater 
than lO 1000 years in the first case and greater than io 1,000,1)00 years 
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in the second case. Some consequences of this fact will be mentioned 
in the third chapter. 

(e) It would be of the greatest importance to apply numerical 
methods to find some characteristics of a stellar system which would 
give indications concerning its evolution. Such methods were used 
by von Horner [9], [10] and we shall mention them presently. 

6. Numerical Calculations of the n-body Problem. Von Horner [9], 
[10] calculated numerically the evolution of some clusters with a 
small number of stars, up to 25. Unfortunately it is very difficult 
to calculate the evolution of a cluster with a great number of stars 
because the time needed for such calculations is prohibitively great. 
In fact the time needed by von Horner to calculate the evolution 
of a cluster for one time of relaxation is proportional to ft 4 . He 
spent 40 minutes (2/3 hours) for calculating the evolution of a 
cluster of 16 stars for one time of relaxation with the computer 
“Siemens 2002.” Therefore for 160 stars one should need 270 days 
with the same method and the same computer. 

However, even with a small number of stars, one can find many 
interesting results. 

The initial positions and velocities used by von Horner were a 
random distribution within a sphere in coordinate space and in 
velocity space correspondingly. Hence an initially constant density 
was assumed. 

After some time, equal to 2 or 3 relaxation times, a somewhat 
stable distribution of density and velocity was established but this 
distribution changed gradually afterwards. 

Although the first calculations (namely those in [9]) indicated 
a Maxwellian velocity distribution, it was found later (see [10]), 
when a greater number of stars was used, that this distribution was 
not even approximately Maxwellian. There were a greater number 
of large and small velocities than expected. The high-velocity stars 
were not escaping but formed close binaries near the center of the 
system. On the other hand the excess of small velocities corresponds 
to stars in the outer parts of the clusters. 

The number of escaping stars was almost 8 times smaller than 
expected. In general the stars going to the outer parts of the system 
were fewer than expected. This fact indicates that escapes are much 
more rare than usually assumed. 
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In the first calculations the density of the central parts of the 
cluster was represented by an isothermal gas sphere. However later 
(see [10] ) it was found that a central condensation is formed and 
the density follows there the law 1/r 2 . In the outer parts the density 
follows approximately the law 1/r 4 , as expected theoretically 
(see [44]). 

Chandrasekhar’s formula for the time of relaxation was verified. 
The virial theorem was also approximately satisfied, i.e., V/2 T ^ 1 
within a factor 2. 

These results show that the statistical theory of stellar systems 
is not yet sufficiently developed. Although at first almost all the 
main points of the theory seemed to be verified (the Maxwellian 
distribution of velocities, the formation of an isothermal core, etc.) 
it is now realized that many problems need reconsideration. 

There are many problems in this field that could be studied by 
means of numerical integrations. 

(a) It would be interesting to increase the number of stars in a 
cluster. Such calculations need of course the use of faster electronic 
computers. However, one improvement in the method of integration, 
which may give good results in a relatively short time is the use 
of different integration steps for different orbits, according to the 
closeness of each star with the others. 

(b) It would be very important to calculate the times of relaxa- 
tion for different kinds of orbits, namely radial orbits and circular 
orbits. Further, one should calculate the evolution of a cluster 
whose orbits are initially approximately radial, or circular, or 
specific mixtures of these two types. 

(c) One could study the distribution of the velocities at each point 
to find the change of the velocity ellipsoid. It should be useful to 
study also the distribution of the energy. Preliminary results (see 
[10]) indicate a distribution law [ — E]~ p with p variable in time. 
The distribution of the angular momentum is also important. 

(d) The study of the formation and dissolution of binaries is 
another important topic. Are these binaries similar to the usual 
binary stars? It seems that triple or quadruple stars can also be 
formed. 

(e) It would be interesting to study not only spherical, but el- 
liptical distributions of stars also. 

(f) Another relatively easy problem is the study of an expanding 
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system, like an association. The other extreme, that of a collapsing 
system could perhaps explain some phases of the evolution of a 
stellar system. 

Of special importance is the study of the final evolution of a 
stellar system, after most of its stars have escaped. It seems that 
this final state is a multiple or even a double star. This problem 
would need a very long calculating time. However, even the cal- 
culation with a very small number of stars, say 8 or even 4, would 
give useful results. 

(g) Another problem arises if we drop the usual assumption that 
all the stars have equal mass. The distribution of stars of different 
masses is also very important. 

(h) The evolution of a system whose stars change mass is still 
another problem. Thus one could explain the distribution of red 
giants, novae, and white dwarfs in clusters and in the Galaxy. Such 
a problem was considered lately by Boersma [46] and Blaauw [47]. 
Blaauw explains the big velocities of the so-called run-away stars 
by assuming that they belonged to binaries whose primaries lost 
mass very rapidly, so that the gravitational attraction suddenly 
diminished and the companions were ejected. 

(i) Another problem is the evolution of a stellar system that is 
not isolated, but is subject to the tidal force of the Galaxy or of 
passing-by clouds. As a good approximation one should take the 
tidal field as due to just a point mass. 

More problems of this type are considered by Ulam [45] and 
von Horner [9], [10]. 

These problems can be studied by using fast numerical computers. 
At the present time the analytical methods in the n-body problem 
seem to have exhausted their power. However, there is a revival 
of interest in the rc-body problem by using the results of numerical 
computations. These computations will give the necessary indica- 
tions for further analytical progress, as well as for further numerical 
experiments. The fact that some of the first numerical results by 
von Horner [9] have been already revised in [10] indicates that 
this field of research is still quite open. 

III. The continuum approach. 

1. The Distribution Function. The continuum approach considers 
a stellar system as composed of a very great number of particles 
distributed in a continuous way in space while their velocities form 
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also a continuous distribution in the velocity space. The points of 
the 6-dimensional phase space (the ^-space) are distributed as fol- 
lows: If dN is the number of stars with coordinates between 
and x x + dx u x 2 and x 2 + g?x 2 , x 3 and x 3 + dx 6 and velocities between 
Xi and x x + dx lf x 2 and x 2 + dx 2 , x 3 and x 3 + dx 3 , then 


(37) dN = f(x u x 2 , x 3 , x u x 2 , x 3 , t) dx l dx 2 dx 3 dx l dx 2 dx 3 , 
where / is the distribution function. We write 

(38) dN = f(x, x, t) dx dx. 

The equations of motion are 


(39) 


dx 


dx 


dV 


dt X ’ dt dx 9 


where V is the potential energy of a star per unit mass. 

We shall prove that / is an integral of the equations of motion. 
In §11.4 we have seen that 


(40) 


D(x,x) 
D(x o, x 0 ) 


On the other hand if we represent the number of stars in the six 
dimensional volume M 0 by SSm^ f(x o, 0) dx 0 dx {h this number 

remains the same during the evolution of the system; i.e., if x 
= x(x Q , x 0 , t), x=x(x 0 , x 0 ,t) is a solution of the equations of 
motion with initial conditions x 0 , x 0 for t = 0, then we have 


(41) 

But 



, x, t) dx dx 


M 



x 0 , 0) dx 0 dx 0 . 


D(x, x) 


j J f(x,x,t) dxdx = j j /(x(x 0 , x 0 , t),x(x 0 , x 0 , t) y t) dx 0 dx Q 


Mq 


ff 


f(x u , x 0 , 0) dx 0 dx 0 . 


Hence by (40) 

(42) J j [ f(x(x lh x 0 , t),x( x„, x„, t),t) - f(x lh x 0 , 0) ] dx ( ,dx ( , = 0 


PROBLEMS OF STELLAR DYNAMICS 


193 


for all regions Af 0 , i.e., f(x f x,t) =f(x 0 ,x 0 , 0) if (jc, jc) is a solution 
of equations (39). Therefore f is a first integral of the equations 
of motion. 

By differentiating / with respect to t we have 


(43) 


df * df df dV 

h X — Xi 

dt f^ldXi dXidXij 


= 0 . 


This is Liouville’s equation for a stellar system. 

The corresponding system to the partial differential equation 
(43) is 


(44) 


dxi 

dx 2 

_ dx 3 _ 

dxi 

dx 2 

dx 3 

Xi 

*2 

*3 

dV - 

dV ~ 

dV 




dx x 

dx 2 

6X3 


i.e., it is the same as the system (39). 

If I u 1 2 , are the 6 integrals of this system, the solution 

of (43) is / = /(/i, J 2 , ■ • Ig). This theorem is sometimes called 
Jeans’ theorem [48] but it was given earlier by Poincare (see 
[49], p. 101), and in any case it is a well-known theorem in the 
theory of differential equations. Two integrals of Liouville’s equa- 
tion are well known, namely the energy E and the angular mo- 
mentum C. 

If we consider only the distribution of the velocities in a region 
M v near the point (x l ,x 2 >x 3 ), we may write 


(45) dN = Fdx , 

where 


(46) 



In order to describe the observed distribution of stellar velocities 
near the sun, K. Schwarzschild has given F in the form 

(47) F = A exp( — K 2 R 2 — H 2 (0 2 + Z 2 )), 

where A = NKH 2 /i r 3/2 , N is the number of stars in the region M 0J 
K and H are functions of the position only, while R is the radial 
velocity directed from the center outwards, 0 is perpendicular to 
it on the galactic plane, and Z is perpendicular to the galactic plane. 
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Generalizations of this distribution are due to Charlier, Eddington, 
Jeans, Oort and others. 

The most general ellipsoidal law is given by Chandrasekhar [50], 
[51], [2]. He takes 

3 3 

(48) /= A<t> ( X X a y(*i - x i0 )(Xj - x j0 ) + o , 

\ i=i >=i / 

where a tJ ( = a, t ), x 10 , * 20 , * 3 u, o and V are 11 unknown functions of 
the position and time only, such as to satisfy Liouville’s equation. 
The velocity of the local standard of rest, i.e., the mean absolute 
velocity of the stars of the neighborhood of the point (x u x 2 , x 3 ) 
is (x 10 , x 2 o, ^ 30 ) • 

If this formula is introduced into Liouville’s equation, we find a 
system of 20 differential equations for the 11 unknown functions 
above. Chandrasekhar has solved completely the corresponding two- 
dimensional problem and many special cases of the three-dimensional 
problem also. He proved that a finite system in a steady state 
must necessarily have axial symmetry. The most general solution, 
however, has not been given. 

Chandrasekhar studied in more detail the case when the velocity 
ellipsoid is reduced to a sphere: 

(49) f = A<}>(a((x x — x lu ) “ + (x 2 — ^20) 2 (*3 ~ *;k>) 2 ) + • 

The solutions include many spiral forms of the density distribution. 
The same spiral solutions are given by Schiirer [52], by a much 
simpler method, namely by a change of coordinates. It was thought 
for some time that these solutions could explain the spiral form of 
the galactic system. However the spherical distribution of the 
residual velocities is not realized in the actual Galaxy. Further it 
must be stressed that even the general ellipsoidal distribution of 
the velocities is only an approximation. In fact <t> should have not 
only quadratic terms in the residual velocities, but third order terms 
also etc. These third and higher terms are small if the velocities 
are small, and for this reason they are omitted in a first approxi- 
mation. 

Another limitation of Chandrasekhar’s analysis is that it does 
not deal with self-gravitating stellar systems. 

In a self-gravitating stellar system we have also Poisson’s equation 
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( 50 ) 

where 


T7 d 2 V d 2 V d 2 V A „ 

A V = ^~2 +372+^72 =4*Gp, 


dxi dxi 


dxi 


(51) P = j Fdx 

is the particle density. If stars of unequal masses are considered, 
then F is replaced( 4 ) by f mFdm\ p is then the mass density. 

By applying Poisson’s equation one finds that not all the solutions 
of Liouville’s equation represent self-gravitating stellar systems. 
E.g., Kurth [55], [56] proved that in the special case discussed by 
Schlirer the system either is in a steady state or it is homogeneous. 
This led to the assumption that all ellipsoidal solutions of Liouville’s 
equation are either in a steady state or homogeneous. This has 
caused a decrease of interest in these problems in the last years. 
However the problem is not yet settled completely. First there are 
some solutions that cannot be given by Schiirer’s transformation. 
Further the general ellipsoidal solution of Liouville’s equation is 
not given. It would be useful to complete the discussion and see 
to what extent Kurth’s theorem can be applied. 

At any case there always remains the main problem of finding 
solutions that satisfy both Liouville’s and Poisson’s equations, 
especially for systems in a steady state. Prendergast [57] has given 
the general solution in the one-dimensional case. Other special solu- 
tions are given by Camm [53], [58] for stratified and spherical 
systems, by Kurth [59] for spherical systems, etc. 

In the case of spherical stellar systems Poisson’s equation becomes 


(52) 


d 2 V 2 dV _ 1 d / 2 dV\ 
Hr 1 + r~dj r ~Wdr\ r ~dr) 


= 4 irGp. 


Hence 


( 53 ) 


dv r r 

r 2 -^— = 4 tG I pr 2 dr = Gm(r), 
dr Jo 


where m(r) is the mass included in a sphere of radius r. 
If the mass of a system is finite, then 


( 4 ) Cf. G. L. Camm [53], A similar method was used earlier by J. Jeans [54, p. 231]. 
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(54) 


d V Gm (r) Gm _ 

dT = _ ~~P~ 


and 


i.e., 

(56) V„ — + V(r 0 ). 

r 0 

Therefore, since V is an increasing function of r, it has a finite 
upper bound. 

If F = F(E f C) is a function of the two known integrals of 
Liouville’s equation, then Kurth proved that for E ^ V«,, F(E, C) — 0. 
Camm has considered the case where 


(57) 


F = 


(A(2E l - 2 E)\ if E^E V 


i 


0, 


if E > E i, 


and fi > 0. 

This is consistent with the above restriction if E { ^ V m . Then 


FdRdodZ 


(58) 


-/I/*- 

= 2* A JJ (-2V+2 E,- R--SYdRSdS, 


where S= (O 2 +Z 2 ) 12 is the transverse velocity, and the double 
integral extends to all the values where S > 0 and 

(59) R' z + S 2 ^2E l -2V. 

If we write 2 E x — 2 V = if we find 


p = 2 

2irA 




' ( (n 2 - R 2 ) - s*y - dS 2 

- f U (i2 2 — R*)'+ l dR, 

1 J» 


( 60 ) 
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and if we set R = st\/ 1 


(61) 


P = -+V a 2 ^ 3 f (i - ty +l r l 2 dt 
P + 1 Jo 

= -?A-(2E l -2V)» + ^B („ + 2,|) 


where 


B (m + 2,- 


4 ) 


r(i» + 2) r 


i) 


’( M + 1) 


whence finally 

(62) 


7r 3/2 A r( M + 1) 


(2£ 1 -2V)'' 


r U + 


If we put this value in Poisson’s equation (52), setting 
(63) $1= — 2V + 2Ei = ( — 2 y 0 + 2E l ) \p and r = cx 

with 


(64) c 2 = r 
we get 

(65) 


p + 1) (ro. + linilAO-V 1 ^ - 2 Vq) " 1,2 » 


af's) 


This is the Emden equation of a poly tropic gas sphere (°) with 
index n = p + 3/2. The initial conditions are \p = 1 and dyp/dx = 0 
for x = 0 (because dV/dr = 0 for r = 0). 

Emden [60 ] proved that for n < 5 we have finite radius and mass. 
For n > 5 the radius and mass are infinite. For n = 5 we have the 
solution 


(°) A polytropic gas sphere is a sphere of gas whose pressure is proportional 
to the 7 power of the density; p y . In Equation (65) we have 7 = 1 + l/n 
and p = yp n . 
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(66) 


(3q) 1/2 

* (3 + q*x 2 ) 1/2 ’ 


and 


(67) 



B 

(a 2 + r 2 ) 5/2 ’ 


with 

2 3c 2 , jj / 3c 2 \ 5/2 

(68) a 2 = -^r, and B = j . 

Then the total mass M = 4irB/3a 2 is finite but the radius is in- 
finite. This is the law of Schuster, applied by Plummer in the 
globular clusters. (See applications of poly tropes [61 ], [62], [63].) 

Another important case is that of an isothermal gas sphere. In 
this case we have a Boltzmann distribution of energy 


(69) F = A exp( — 2a 2 E), 

and correspondingly a Maxwellian distribution of velocities. Then 


p = 


exp( - a 2 (ji jf + + x| + 2 V)) ■ di^dii 


(70) 


A n- 3 ' 2 

— 3— exp(— 2a 2 V), 


or 

(71) p = p 0 exp( - 2a 2 ( V - V 0 )) 


with 

4.r 3/2 

(72) po= exp(- 2a 2 V 0 )- 

Then the equation of Poisson gives 


(73) 


Id/ 2 du\ 

Y l dx V Tx) 


= exp( — u), 


with 

(74) 2a' z (V-V 0 ) = u, r=cx, c 2 = (8ira 2 G#><>) '*• 
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The initial conditions for x = 0 are u — 0 and du/df = 0. 

This is the equation of an isothermal gas sphere. This model 
gives an infinite radius and mass, as can be derived, e.g., from the 
fact that F does not become zero for any value of E . However it 
is assumed that this model represents approximately the central 
parts of stellar clusters. 

It is proved that if the mass of a system is finite, then the condi- 
tion F(E, C) = 0 for E ^ V „ is necessary. This condition, however, 
is not sufficient. E.g., if we have a polytropic sphere with an index 
n > 5 the mass is infinite, although the condition is satisfied. 

However Kurth proves [59] that in these cases there exists a 
function F(E, C) arbitrarily close to a given function F(E, C) for 
which the corresponding mass and radius are finite. 

Axially symmetrical stellar systems that are consistent with 
Poisson’s equation have been considered by Camm [64], Kurth 
[55], Fricke [65], [66] and lately by Lynden-Bell [67]. 

It is an open question whether there exist systems where the 
distribution of velocities is everywhere ellipsoidal. Camm and 
Fricke proved under certain assumptions that no such system 
exists, while Kurth gave a method to obtain such systems. All 
these discussions are not yet complete [68]. Camm assumed that 
if a system has a finite mass then the potential has an asymptotic 
expansion 


(75) 




for large r (where c > 1), and in this case he proved that the density 
must be zero. However if 



for large r we cannot have an asymptotic expansion of the form 
(75) because 

(tab)/^) to"---- 

But the system (76), although it extends to infinity, has a finite 
mass, because Poisson’s equation gives 
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(77) 


1 d ( ,d ( GM_( 
4ir Gr* dr\ dr \ r \ 

{(logr) 2 + (logr) 2 ]’ 


logr 


))! 


MS 

i^r 21 


m„ — m(r 0 ) = MS 


(78) 


J 


dr 


= MS 


r(logr) 2 

+ 


+ 


2dr 


r (logr) 


t log r o (logr„) 2 ] 

Therefore there may be cases that are not covered by Camm s 
proof. On the other hand Kurth’s solutions do not give a finite 
mass; in fact Kurth gives the potential function by solving formally 
Poisson’s equation, written in the form 

(79) A V — 4 t vG P ( V; x l9 x 2 , x 3 ) , 

where 


(80) 


p = 



F(E , * * •) dxidx 2 dx A ; 


here E = (x'{ + x'i + xj)/2 + V , and the dots after E represent the 
other integrals of motion. Then we can write 

, „ c r r p(v(^^Q;^^Qd^ 2 d^ 

(81) V(x u x 2 ,xJ = G J J J \x~~T\ ’ 

Kurth finds V either as a series in G, or l/\/ G, or by an iteration 
method, setting V 0 = 0 and computing after V n is found, by 

the equation 


(82) 



p( V n ; £i, £n) 

i*-ir 


d£xd£ 2 d£:\. 


Then 

(83) V = lim V n . 

n -+ 

Let us consider the last method and assume that there are only 
two isolating integrals E and C. 
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In a system of finite mass we may take V w = 0; thus 
p(0;£i, £ 2 »£a) = 0 for all the values of C, because F(E,C) = 0 
for E = (xf+ x£ + xf)/2 > V» = 0. Therefore Vi = 0 and generally 
V n = 0 and V = 0, i.e., this method does not give V. 

The same is the case with the series expansions of the form 

(84) V — V 0 (x u X 2 ,x^ + G Vi(xi, x 2 , x 3 ) 
and 

(85) p = p 0 (xi, X 2 , X3) + Gpi(x u x 2 , x 3 ) + * • *, 

because again for V 0 = 0 we have p = p 0 = 0 and then Equation 
(81) gives, step by step 

V x = V 2 = • . . = 0. 

Therefore Kurth’s method does not work for systems with finite 
mass. 

Danby proved that in the case of an ellipsoidal distribution of 
velocities there is no third integral besides the energy and the angu- 
lar momentum. 

If we drop the ellipsoidal hypothesis we can find axially sym- 
metrical systems whose distribution function F is a function of 
E and C. In fact Lynden-Bell [67] solves the equation 

p = Fdx 

for axially symmetric systems when p is given, and finds F — F(E , C) . 
There is a special solution that is even with respect to C; i.e., 
F=F(E,C 2 ). 

The general solution contains also any odd function A F with 
respect to C because 

(86) f A Fdx = 0. 

AFis found by Lynden-Bell by means of relaxation considerations, 
assuming that the stellar encounters cause the velocity dispersions 
along the 3 axes to be equal. This, however, is not the case in our 
Galaxy. (See also [236].) 

The distribution function of a spherical system is not necessarily 
a function of C only, as was assumed by Jeans [69, p. 365]. The 
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general form of F is 

(87) F = F(E, Ci, C 2 , C 3 ), 

where C b C 2 , C 3 are the three components of the total angular 
momentum. 

Lynden-Bell [ 70 ] and Woolley [71 ] considered a case where a 
spherical cluster is rotating. Such a rotating cluster can be con- 
structed out of a nonrotating one by inverting the velocities of 
the stars moving, say, clockwise with respect to a given axis. 

More generally, we may invert the different velocities of different 
groups of stars rotating around different axes, without changing the 
spherical form of the cluster. 

Fricke [65], [66] has made a severe criticism of the ellipsoidal 
theory of stellar velocities. He points out that the ellipsoidal distri- 
bution is only a rough approximation to the real distribution of 
stellar velocities, and then only if these velocities are small. It 
is well known that the high velocity stars have a very asymmetric 
distribution and therefore their motions can by no means be de- 
scribed by the ellipsoidal theory. 

Fricke made a search for more general distribution functions F 
of the form F{E, C) that are not quadratic in the velocities, but 
satisfy Poisson’s equation. He considered integrals of the form 

F = Z“*E l C k 

i,k 

(polynomials in E and C) for systems with finite mass. The solu- 
tions found are of the form 

( 88 ) E*‘- 2 XAkiE k ' l C 2k + EW^BuE'+'C** 1 . 

Here N is a nonnegative integer and in both summations k goes 
from 0 to n and l goes from 0 to m. Fricke gives the examples 

(89) F = (2E) ba (2 + 100(2E) 4 C 6 - 49(2£) a C 5 (l + (2 £) 2 C 2 )), 
and 

(90) F= (2£) 5/2 (0.5 + 100(2E) 11 C 2U + 49.5(2£) lu C ly (l + (2 E) 2 C 2 ))\ 

these two distributions of velocities can describe both the almost 
ellipsoidal distribution of small velocities and the big asymmetric 
velocities of the high velocity stars. 

However the functions used by Fricke seem rather artificial. No 
obvious reason suggests the use of any of the above functions F. 
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Further, the fact that the velocity ellipsoid is flattened along the 
2 -axis cannot be explained if only two integrals of motion E and 
C are used in the distribution function F . 

2. Integrals of Motion. The equations of motion in an axially 
symmetrical potential field are 


(91) 


dr 

¥ 


dz 

~Z 


dR 


de 


dZ 


dV_ o 2 _ Re _dV_ 
dr r r dz 


= dt, 


and these have two well-known integrals: the energy 

E=(R 2 +e 2 + Z 2 )/2 + V, 

and the angular momentum C = r0. 

The general form of an integral that is a function of E and C 
and is quadratic in the velocities, is 

(92) I =2E- 2 k x C + k 2 C\ 

or 


1= R 2 +Z 2 + 0 2 (1 + k 2 r 2 ) - 2 k,re + 2V 


(93) 

where 

(94) 


- R 2 + Z 2 + (1 + k 2 r 2 )0 2 -\- 2V - 


ky 


1 + k 2 r 2 


0 = 0 — 0 O and 0 O = 


kir 


1 + k 2 r 4 


( 6 ). 


( ) Then the constants of Oort are 
A = 


and 


1 / Oo _ dQ o\ 

2 \ r dr) 

fi= _i/e ! , + dGo\ _ 
2 \ r dr } 


, = *=a-b = 


kik 2 r 

(i + vV’ 


a + v 2 ) 2 ’ 




r 1 -h k 2 r 

and the square of the ratio of the axes b (along 0) and a (along R ) is 

1 

a~ 1 + k 2 r l 


b A 

~ 


_ A 
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If k 2 — 0 we have three equal axes. This happens when statisti- 
cal equilibrium is reached. In this case 0 O = k\r, i.e., the system 
rotates like a rigid body. 

The velocity ellipsoid has the axes R and Z equal. However 
in K. Schwarzschild’s ellipsoidal distribution defined by (47) the 
axes 0 and Z are equal, and much smaller than R. In a more 
general form 

/ R 2 O 2 Z 2 \ 

(95) F = A exp ( ^ “ — t ) ’ 

\ <J\ <r 2 03 / 

the values of the dispersions a 2 , 03 are given for different groups 
of stars by Nordstom [72]. In general it is <r x > <r 2 > but some- 
times <xi > a 3 > <r 2 . In any case the axis R is always greater than 
the other axes. 

The observed inequality of the R and Z axes cannot be explained 
if only two integrals of the equations of motion are used. 

As we have seen, Liouville’s equation has 6 integrals of motion 
that are, in general, time dependent. By elimination of the 
time we get 5 time independent integrals. These, however, are not 
in general isolating (Wintner [73, p. 93-97]; Contopoulos [74]); 
i.e., if they are solved for one variable as a function of the others 
they give an infinity of solutions that are densely distributed in 
some interval. E.g., in the case of two perpendicular oscillations 
we have the system 


(96) 


dx dy _ dX __ dY 

X ~ Y ~ - Ax ~ - By 


(X, Y are the components of the velocity), with the integrals 


(97) 4> 10 =i(X‘+^), 4-20 =^(Y 1 +By 2 ). 


If A ia /B 1 ' 2 is irrational, the third time-independent integral 


(98) 


^1 . _j B l,i y 

7o-gi7 2 sm 1/2 


l 1/2* 


i 1/2 


sin 


(240 1/2 ’ 


is not isolating. 

In fact we have 

B^y 

(24>2o) 1,2 


/ B m 

sin yB xa T 0 + -^-rj^sin 


A ia x \ 

(2-i>,o) 1/ V ’ 


(99) 
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and for a given value of x we have infinitely many values 

sin ''<s^ -!i+2K * ° r sin "(s^- (2K+i)r “ s " 

Thus 


1/2 

sin ( fi 1 2 Tq+ - ($i + 2Kn) 


A 

= sin ( B ir2 


To- 


—) 

A 1 ' 2 ) 


+ 


2KtB 112 


1 it 


2Ltt 


) ’ 


where K , L are arbitrary integers; 2tt(K(B 1/2 /A 1/2 ) — L) can be 
made to approach any number arbitrarily closely, by a suitable 
choice of the integers K , L. Therefore the values of B i/2 y/{ 2<J> 2 o) 1/2 
are densely distributed in the interval ( — 1,1) and this means that 
the integral is not isolating. If A i/2 /B l/ 2 is rational, we have only 
a finite number of values of y for any value of x and the integral 
T 0 is then isolating. 

Many authors have used isolating integrals besides the energy 
and the angular momentum in some special cases. E.g., B. Lindblad 
[75], introduced the integral 

(100) / 3 = Z 2 + 2(V- Vq), 

where V 0 =V(r,0). In this case, however (d 2 V/drdz) = 0; this 
integral is approximately valid for very flattened systems. Van 
Albada [76] and Kuzmin [77], [78], [79] have studied especially 
the integral 

(101) I 3 = (rZ - zR) 2 + z 2 e 2 +c%(Z 2 +2V*), 

where V* is connected by two relations to the potential energy V , 
and c is a constant. This integral was introduced for the first time 
by Eddington [80]. In this case V is of the form 


(102) 

where 


£i — Vi 


(103) 


= r 2 + z 2 ± (c 4 - 2c V - r 2 ) + (r 2 + z 2 ) 2 ) 1 ' 2 . 
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A general case where an isolating third integral can be found 
has been given by Stackel [81], [82]. In the two-dimensional case 




Af(f) + N(n)- 


^ L K(f) + L(„) J 

while the line element is 

(105) ds 2 = (K(0 + L( v ))(dt i + dr, i ). 

In this case the corresponding momenta to the variables £, y are 

(106) p ( = (K(Z) + L(r,))k, p„= (K(S) + L(i»))n, 

hence the energy integral is 

(107 ) H P±±Pl 1 ,0 

UU/) 2LX(t)+L(,)J L K(lt) + Lit,) J 

Then the Hamilton-Jacobi equation is separable, and two integrals 
can be easily found 

(*(£) + L(„)) 2 *7 2 - M(£) - aK(& = - 6, 

(108) 

(* (£) + L(„))V/2 - NW - aL(„) = 6. 

By adding these two integrals we derive the energy integral (107). 
Stackel considered also systems of more than two dimensions, where 
we have separation of variables in the Hamilton-Jacobi equation. 

Weinacht [83] proved that if the line element is Euclidean the 
most general cases where we have separation of variables in 2 or 
3 dimensions which can be reduced to Cartesian coordinates by 
a point transformation, are of the Stackel type. 

Van de Hulst [84] used the transformation 

r = c sinh £ cos t? — k + r 0 , 

(109) 

z — c cosh £ sin 7j, 
and a potential of the form (104) with 

#(£) = 2c 2 cosh 2 £, L(rj) = — 2c 2 sin 2 ij, 

M(£) = — 2c 2 cosh 2 £Vo, N(r}) = — Q(c 2 + fc 2 )2sin 2 ?7, 


where c, k, r 0 , Q are constants, and V Q is the potential for rj = 0. 
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This is a generalization of Eddington’s potential, which cor- 
responds to k = r 0 . It is easily proved that if k = r 0 

( 110 j c 2 (2cosh 2 £- 1) = (or = vi), 

c 2 ( 2 sin 2 jj - 1) = in (or = £i). 

Van de Hulst’s transformation (109) is the most general point- 
transformation in two coordinates that gives separation of variables. 
More general transformations that give separation of variables one 
can find only by using contact transformations (Siegel [32], 
Contopoulos [74], [85]). 

A systematic search and classification of potentials with isolating 
integrals has been done lately by Lynden-Bell [86], [87]. Lynden- 
Bell considers only integrals depending on an arbitrary function 
(corresponding to potentials that also depend on the same arbitrary 
function), and he calls such integrals “local.” 

In general, however, a dynamical problem is nonseparable, or 
more accurately, there is no known transformation that can reduce 
it to a separable one. 

In such cases a third integral of motion can be introduced in the 
form of a series. Whittaker [88] and [89, Chapter 16], called this 
integral the “adelphic integral” (from the greek word “adelphos,” 
meaning brother) because it is very similar to the energy integral. 
This is given in the form of a trigonometric series. Another form 
of this integral was given by Cherry [90], [91], [92]. The third 
integral in a form directly applicable to stellar systems was given 
by Contopoulos [93]. 

If we have a Hamiltonian system of the form H = H z + H 3 
+ H a + * * * , where H k is an homogeneous polynomial of degree k 
in the variables, we can make a linear canonical transformation of 
variables so that H 2 becomes 

n 

= £ Kx.y,. 

i = 1 

Then there is a formal canonical transformation of variables 


(111) 


x, = F(x t ,y,) = jc, + 

G(Xi,yi) = y, + •••, 


such that H becomes a function of the products a, = x l y l only. 
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The canonical equations become 


( 112 ) 


dxi _ dH _ — dyi _ 

~dr~w,~ m,Xl ’ ~dr~ 


— =r = — rriiy \ (i = 1,2, ***,n), 

dXi 


where 

(113) 


/n t = 


d JL 

dVi ' 


Then 


-dx t dy, 

y ‘jr +x ‘dr - 0 


or 



Hence we have n integrals 


(114) Vi = x.-yj = const. 

Then rrii are constants, and therefore the solution of the Hamiltonian 
system is 

(115) Xi = a, exp(m,0, y t = 6,exp(m,0. 

The same transformation was introduced by Birkhoff [94, Chapters 
3, 8]. His method is similar to the methods of Delaunay [95, p. 541] 
and von Zeipel [96] in classical Celestial Mechanics (see also [74]). 

We shall presently apply the von Zeipel method in the case of 
two dimensions. 

If 

(116) H = H 2 + Hs + • • •, 
where 

(117) H 2 = + ^2 X 2yiy 


we introduce the determining function 

(118) S = S(x l ,x*y u yJ=S 0 +S l + •••, 
where 

(119) S 0 = x x y x + x 2 y 2 , 

such that the new Hamiltonian H is a function of v x = x Y y x and 
v 2 = x 2 y 2 only. We have 
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(120) 

Hence 


as _ as, 
y, — t — = Vi + -t — h 

dx l dXi 


as, 

i “1“ . — “1“ 

dy, 


as, as 2 


uu, uu, -r— » ( a 2 S, / 

X ‘ = X ‘~ ~Z= “ "aTf ' I 3,7^ ( _ 


ay, ay, ^ I ay, ax, 


as, 

ay) 


+ 


(121) 


_a^/as, \ j _ 

ay)dy, \ dxj ) ) + 


_ as, as 2 2 ( a 2 s, / as, 

= v, -i a — i a- Y < 1 


yi = y, + -r= + ~r= + L j 7“-^-,— 
ax, dXi ,_, i ax, ax, 


ay, 


= + 


+ 


4S = (^ + ...)l +... 

dXidyj \ dXj J ) 


ax, ay, 

where the derivatives are calculated for x) and y) . Then 


tf(x„y,) = H (x)-^ 

ay, 


_ as, 

• , y, + — — + 
5x,- 


= H _y a^as, 2 a# 2 as, 

2 ,=i ax) ay) i= , ay) ax) 


, H _ y dHs_dSi * djh as, 

3 ,=i af) ay) , =1 a>^ ax) 

d22) _y ^[^2 (^_as, __^s L as,j 1 

fr, ax) L ay) ( ayidx} ay) ay)ay) ax) j J 

j, djh r as 2 _ 2 ( a 2 s, as, a 2 s, as,| 1 

ay) L ax) ,_, (ax)ax) ay) ax)ay) ax) I J 

v y i f a 2 H 2 as, as, a 2 H 2 as, as, 

+ nr; h 2 ( ai)ax) ay) ay) ax)ay) ay) ax) 

a 2 # as, as 2 ) 

+ ay)ay) ax) ax) 1 

+ H 4 = H-i + + H 4 + • • • ; 

the derivatives of H 2 ,H 3 ,--- are also calculated for x), y). E.g., 
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(123) 

Hence 

(124) 

(125) 
and generally 

(126) 


dH 2 - dH 2 - 

a — Kyi, a — — A,x„ etc. 
dx t dy ,■ 


H 2 =H 2 , 

^ ^-dS l * _dS, „ Q 

= Ha - H 3 = 

iTi d*, frl 5yi 


- 2.X t y,-^r = if*- V*- **• 
d*« -Ti dy, 


where is found after the terms S 1? are defined. If we 

write Q k = H k — Q' k , and take H k to be the sum of the terms of 
the form (xiyd ai (* 2 5^)° 2 i n QL then H* is a function of and v 2 
only, and Q k contains only terms of the form x* 1 y* 1 x% 2 y£ 2 in which 
(«i - b\) 2 + (fla — ^ 2 ) 2 7 ^ 0 . 

The corresponding system to the Equation (126) is 


(127) 


dx x _ dx 2 __ dyi _ _ dy 2 _ jS* 

x^i X 2 x 2 x^! x 2 y 2 Q* 

and it has the integrals 


= x x y u v 2 = x 2 y 2 and ^ = c. 

x 2 


(128) 

Therefore S k is given by 

s.- f 

J Ai*i 


(129) 


ctx 

- — — T Sk o(^b C ) > 


where in Q* we have replaced y 1} y 2 , by vjx u u 2 /x 2 and x 2 by 
xJ 2/X1 c -a 2 ; s k0 i s a n arbitrary function of v lt v 2 ,c. 

Let x? v y b i l x 2 *y 2 2 be a term of Q k . This is written 


v\ l v$x\ > x \c~ k * Ka * b2) 


where 


X = X 1 (ci 1 — 61) + X 2 (a 2 — 6 2 ) • 


After integration it gives 
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vt 1 vfrx^c-^-Vx- 1 , 

which can be rewritten as 


(130) 


^l( a l ~ bi) + X 2 ((Z 2 ~ ^2) 


The denominator (Xi(ai — b x ) + X 2 (a 2 — b 2 )) is never zero if 
X 1 /X 2 is irrational, because (a x — 6 X ) 2 + (a 2 — 6 2 ) 2 ^ 0. 

Thus S and H are constructed step by step; both are functions 
of x, and y t , and H = f(v u v^. Then the transformations (121) 
give Xi , yi as functions of x if y^. The problem of the convergence 
of S and H , however, is left open. 

If Xi/X 2 is rational, one can find S by a convenient choice of the 
arbitrary functions S k -j , 0 (0 < j), so that there should not appear 
terms of the form 

Xi l yi l x 2 2 y 2 2 in Q k with A^a, - 6,) + \ 2 (a 2 ~ b 2 ) = 0. 

The elimination of such (secular) terms has been discussed by 
Whittaker [88], [89], Cherry [91], and Contopoulos [237]. 

In the case of a stellar system with an axis and a plane of sym- 
metry we can develop the potential function in a region around 
a point (r 0 , z 0 = 0) in the plane of symmetry in the form of a series 

(131) 2V= - ( C 2 /r 2 - P? - Qz 2 + 2a^/3 + 2b£z 2 -+ ), 

where £ = r — r 0 ; C is the constant of areas and P, Q, a, b are 
constants. 

In the case of our Galaxy one can take approximately P 
= 0.076 (10 7 years)” 2 , Q = 0.550 (10 7 years)' 2 , a = 0.052 (10 7 
years) -2 kpc 1 , 6 = 0.206 (10 7 years) 2 kpc _i . 

A third integral of motion can now be found as a formal series 
(see [93]): 

(132) $ = <t> 0 + a$ a + b$b + • * ■ + a 2 $aa + Qb^ab + b 2 $bb + * • • 
where 

*0 = \(P^+R 2 ), 

(133) 

*» = * - p - ((P - 2 Q )& - 2 + 2 RzZ), 

— r 



212 GEORGE CONTOPOULOS 

etc. The accuracy of this integral is very great. In two orbits cal- 
culated for 5 X 10 y years, the accuracy is better than 1/1000 if 
terms up to the third degree are included (see [97]). 

The existence of an isolating or quasi-isolating third integral can 
explain the three-axial form of the velocity ellipsoid. In fact, we 
may take approximately 

(134) 

+ 7?A”B(( p - 2Q)tz*-2tZ* + 2RzZ), 

4q> -- H 

and this is quadratic in the velocities. Then 
(135) F=A0(/) 


where 


/ = 2E - 2 k { C + k 2 C 2 + = /?“(! + k 3 ) + (1 + k 2 r 2 )(B - O 0 ) 2 


(136) +Z*( 1- 

+ k.t ( P{j~ 


4 k :i b£ 

4 Q - P 

2a^ 


, R7 

+ 4Q-P 


ky 

TTky 


2b 


3 4 Q - P 


(p-2Q)y 


This represents a three-axial ellipsoid whose main axis forms a 
small angle (proportional to z) with the radial direction R. This 
angle is exactly zero if 2 = 0. 

Barbanis [97] gives for the velocities R and Z the distribution 

(137) dN = A { e 2kE 2L *dRdZ , 

and he finds k — 19.7 (kpc/10' years) 2 , l — 14.1 (kpc/10' years) 
The axes Z and R in the meridian plane have a ratio 0.5, as given 
by observation (see [98]). 

3. Galactic Orbits. The orbits of the stars in a stellar system are 
in general calculated without taking into account the effects of the 
encounters; i.e., a star is supposed to move in the general potential 
field of the stellar system, considered as a continuum, and any 
deviations from this motion, because of close encounters, are dis- 
regarded. This is because the time of relaxation is very great in 
comparison with the mean periods of the motions of the stars. E.g., 
in the Galaxy the time of relaxation is of the order of 10 14 
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years (see [2]), if only the interactions of stars are considered, 
while the period of galactic rotation is about 2 X 10* years. This 
means that a star may, under these conditions, describe 10° revolu- 
tions around the center of the Galaxy before its orbit is markedly 
changed! '). 

In a globular cluster the time of relaxation is about 10 9 to 10 lu 
years (see [2]), while the rotation periods are of the order of 10' 
years or smaller. Again the stars may describe many revolutions 
before their orbits are markedly changed. 

a. Plane Orbits . The orbits of stars in a globular cluster or in 
the plane of symmetry of a galaxy have been widely studied. 

In the case of a globular cluster (see [99], [100], [101], [102]) 
the potential energy per unit mass is 

(138) V = - Gm ^ - 4 t tG j rp{r)dr , 

where m(r) is the mass of the cluster inside a sphere of radius 
r, p is the density and R is the radius of the cluster (that may be 
considered infinite). The equations of motion are now 

1 do 

(139) - (v 2 - vl) = V(r„) - V(r) and r 2 ^- = C. 

z at 

If we set 


(140) 

w(r) = — 2Gj 

f"”*. 

lo r 

and 



(141) 

1 

II 

w(r 0 ), 

we get 



(142) 

v 2 = w(r) + h = 

, dr \ 2 C 2 

u +?• 

Thus we find 



(143) 

/dr\ 2 (w{r) + h) 

\dt) r 2 

r l - C 2 <j>{r) 

r 2 


( ) We shall see that other effects, besides the interactions of stars, make the 
time of relaxation much shorter. However, it is longer than one rotation period. 
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hence 

f r rdr 

(144) 

to J (</>(r)) 1/2 ' 

Similarly 

Cdr 

(145) 

6u Jro r(<t>(r)) 1 ' 2 ’ 


where (r 0 , B 0 ) is the initial position and (dr/dt) 0 is the initial radial 
velocity for t = t 0 . 

The function <£(r) is positive or zero for r 0 (because ro(dr/dt)l ^ 0), 
and it is negative for r = 0 (if C^O), Le., there is a root r v be- 



Figure 5. Angle from Apocentron to Pericentron 
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tween 0 and r 0 . If w{r) — ► — oo when r— > <®, or if w(oo) is finite 
and h < — w{oo), then there is another root r 2 ^ r 0 . 

It can be proved that only these two positive roots exist; if 
h ^ — w{<x>) no second root exists. 

In the first case the orbits move between a minimum r x (peri- 
centron) and a maximum r 2 (apocentron) (Figure 5) and the 
angle 0 between the directions of the pericentron and apocentron 
is in general between 7 t /2 and i r (see [100 ] ) . If this angle is com- 
mensurable with 7 r the orbit is periodic. If however G/x is irrational, 
the orbit is not periodic and it “fills” densely the whole annulus 
between the circles r x and r 2 , i.e., it comes arbitrarily near any 
point of this annulus. Such an orbit is called ergodic. An ergodic 
orbit is also recurrent, i.e., it returns an infinite number of times 
near the initial point. 

In the second case the orbit extends to infinity. It is similar to 
a hyperbola or a parabola. 

In the case of a motion in the plane of an axially symmetrical 
system some other possibilities may appear. In this case <t>(r) may 
have a third root r 3 ^ r 2 and either r x ^ r 0 ^ r 2 or r 3 ^ r 0 . If r t ^ r 0 
r 2 < r s and r x < r 2 , the motion is represented by the rosette of 
of Figure 5. If r 2 < r 3 ^ r 0 , the motion extends to infinity. If, how- 
ever, r 2 = r 3 then we have an exceptional case. The orbit issuing 
from r 0 and moving towards r 2 approaches asymptotically the circle 
r 2 , along a spiral (Figure 6 ) but it does not reach it in a finite time. 
In fact then the integrals 

r r 2 rdr C ri Cdr 

Jro C <t>(r)) l/ 2 Jro r( 0 (r))^ 

diverge, because r 2 is a double root of </>(r). 

If <j>(r) has a double root we have 

(146) 0 (r) = (w(r) + h)r 2 -C 2 = 0 , 

(147) 0 '(r) = w'(r)r 2 +2r(w(r) + h) = 0. 

If now <f)(r ) is negative for values differing slightly from the above 
then we have a stable circular orbit. If, however <p(r) > 0 for slightly 
different values of r, then we have instability. This happens if 
0 (r) = 0 is not a maximum, i.e., if 0 "(r) ^ 0. Then 
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(148) u/'(r)r 2 + 4 rw'(r) + 2 (w(r) + h) ^ 0, 
and by means of Equation (147): 

(149) w" (r) + - W ^ — ^ 0. 

If this inequality holds, the corresponding circular orbits are 
unstable; i.e., a small perturbation will make them spiral outward 
or inward. For more details, see e.g., Lindblad [5], [103] and 
Coutrez [104]. 

The instability of the circular orbits has been used by Lindblad 
in order to explain the spiral arms of the galaxies. He remarked 
that such orbits can appear only if the oblateness of a spherical 
system (the quantity (a — b)/a where a is the major axis and b 
the minor axis) is greater than 0.72. But this is approximately the 
limit of the oblateness of the elliptic galaxies; systems with greater 



Figure 6 . Asymptotic Orbits 
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oblateness usually show spiral arms. However it is believed today 
that the spiral arms are probably due to other phenomena, and 
are not asymptotic orbits of the above type. 

Orbits of stars in different models of the Galaxy have been cal- 
culated by many authors [105], [106], [107], [108], [109], [110] 
[111], [112], [113], [114], [115], [116], [117], [118], [119]. Such cal- 
culations aim at finding the distribution of the pericentra and 
apocentra of the orbits, their periods, etc. 

Another approach to the problem of the plane motions of stars 
in our Galaxy has been introduced by B. Lindblad [120]. He con- 
siders all the motions, that do not differ too much from circular, 
as epicyclic motions. 

If r x is the radius of a circular orbit that has the same angular 
momentum as a given orbit, we introduce the coordinates 

= r - r x and Vl = r x d, 


where 6 is the angle between the directions of the star and the 
point moving along the circular orbit. 

The equations of motion are 


(150) 


d% 

dt 2 


C 2 

(n + fc) 3 


-K(r), 


where K(r) is the force per unit mass at the distance r u and 


(151) 


dr) X Cr x 

~dT = K+W~ Wiru 


where a>i is the angular velocity of the circular orbit. Hence 

(152) C = and K(r x ) = C 2 /rf = a>jf r x . 

If now we expand the second members of (150) and (151) in 
power series of £ x we get to a first approximation 

(153) -jtj T - *i£i = 0, 


and 


(154) 



where 


2o>i£i, 



218 

(155) 

and 


GEORGE CONTOPOULOS 


xjf = 3C 2 /rf + K'(n) = 4a> 1 (o) 1 - A,), 


1 ( d co \ 

as® ■’■--r-U), 

is the first Oort’s constant. 

The solution is then 

(157) £i = c cos K\(t — f 0 ) 

and 

( V 1/2 

— it)l - ) sin Ki(t - t 0 ). 
u>i- AJ 

These are the equations of an ellipse described by the moving 
point in its relative motion with respect to a point moving with 
the circular velocity, and the same angular momentum. If we drop 
this last restriction and take another circle of reference r 0 , where 
r i = + c u then we have approximately 

(159) £ = c { + ccos*(£ — t 0 ), 


( N. 1/2 

— ) s * n K ~~ 

because 

- 2 Ac, = r <fii 3 f - u> 0 ). 

Here k 2 = 4w(a> — A). 

This epicyclic motion can be used in deriving the velocity el- 
lipsoid (see [5]). The velocity components relative to the circular 
velocity at the point (£, 77 ) are approximately 


(161) 



c k sin k( 1 — t Q ), 


(162) v = + 2A£ = 2cBcoskU - f 0 ), 

where B = A — w is Oort’s second constant. Hence 



This relation has been derived already in a different way (cf. 
§111.2). Further work in the same direction has been done by 
Lindblad and others [5], [121], [122], [123], [124], [125], [126], 
[127]. Lindblad has considered especially a specific type of epicyclic 
orbits, the so called dispersion orbits, along which groups of stars 
disperse because of the differential galactic rotation (see [128], [129], 
[130], [131], [132]). These orbits have probably some connection 
with the spiral structure of our Galaxy. 

Some important numerical work concerning the motion of a sys- 
tem of rings on the galactic plane has been done lately by P. O. 
Lindblad [133]. He considers a number of massive points that form 
one or more rings around the center of the Galaxy (the Galaxy is 
represented by Schmidt’s model [134]). Then he considers the 
motions of these points under the combined attraction of the 
Galaxy and of the other points of the rings. It is seen that the 
rings are preserved for some 10 8 years but they disrupt before 10 9 
years in general. Then some formations resembling spiral arms are 
formed. Such a work is especially important because of its con- 
nection with the problem of the formation of the spiral arms. 
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b. Three-dimensional orbits . Calculations of three-dimensional 
orbits of stars in a galaxy were made by Contopoulos [135], 
Torgard [136], Ollongren [137], and Hori [138]. 

If we consider a meridian plane containing the axis of symmetry 
of the Galaxy (considered as ideally axially symmetrical), which 
follows a star in its motion around this axis, we get a two-dimensional 
motion of the star on this meridian plane. In the case that the 
motion does not extend to infinity, the energy integral defines a 
torus around the axis of the Galaxy inside which the three-dimensional 
orbit is contained. The section of this torus with the meridian plane 
is a closed curve. This curve is called a “curve of zero velocity 
because if the moving point reaches this curve its velocity is zero. 

The energy integral is 

(168) E = i(fl 2 + Z 2 + ^) + V, 

where V is the potential energy per unit mass. Therefore the equa- 
tion of the torus of zero velocity (R — Z = 0) is 


(169) 

C 2 /2r 2 + V=E, 


and if 



(170) 

v-K-Z + ee+v-tf- 

2 t£z 2 + •■■) 

with £ = r 

— r 0 , we have 


(171) 

P? + Qz 2 - 2a{ 3 /3 - 2 &£z 2 - • 

.. = 2 E. 

In this case Poincare’s theorem is applicable; i.e., the motions 
are in general recurrent. 

If now the orbits are also ergodic on the energy surface, they 


fill the whole space inside the curve of zero velocity. However, 
the existence of a third integral has as a consequence that the 
orbits fill only part of the available space. This is limited by a 
curvilinear quadrilateral (see [135]), which in our case is approxi- 
mately an equilateral trapezium (Figure 7). The orbits are distorted 
Lissajous figures and the equation of the boundary is given in the 
form of a series. 
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Figure 7. A Box Orbit 
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The same results appear in the orbits discussed by Ollongren 
and Torard, who calculated a great number of orbits in a model 
of the Galaxy given by Schmidt [134]. In general the orbits are 
contained inside a “box” whose apices lie on the torus section of 
zero velocity. This is a very strong indication that a third isolating 
integral exists in this case also and therefore in more general po- 
tential fields. In the case of a separable potential of the general 
form (104) the boundary is composed of two ellipses and two hyper- 
bolas (see [84], [138]). 

Ollengren and Torg&rd found some other types of orbits, the so- 
called “shell” and “tube” orbits. The “shell” orbits fill rings that 
have no point in common with the curve of zero velocity. The 
tube orbits fill elongated narrow strips that have, eventually, 
many folds (Figure 8). These orbits lie near stable periodic orbits. 

The calculation of three-dimensional orbits gives the periods of 
the oscillations along the £-axis and the 2 -axis. In the case of 
the orbits calculated by Contopoulos the axial oscillations have a 
mean period of about 24 X 10' years and the z-oscillations a period 
of about 9 X 10' years (see [84]). Therefore the ^-oscillations are 
realized in a time approximately equal to the period of rotation 
of the Galaxy, while at the same time the orbit makes 2.5 complete 
oscillations along the 2 -axis. 

Another application of these calculations concerns the dispersion 
of a group of stars in the Galaxy. If we have a group o f stars with 
slightly differing initial conditions, their velocities when the stars 
come again near the initial point are very nearly the same, or nearly 
symmetrical with respect to the axes £ and 2 . In fact for any given 
orbit at every point (£, 2 ) there are defined only two directions of 
motion and 4 velocity vectors, of which two are opposite to the 
other two. These directions are exactly symmetrical with respect 
to the axes if 2 = 0. 

This means that though the stars are separated because of 
their differential motions, whenever they return near the same point 
they have almost the same velocity vectors. Further, even if they 
have some dispersion in their velocities, their mean periods are very 
nearly the same (see [184]); therefore the stars of a group come 
near each other again and again during a time interval of at 
least some billion years. This fact may explain the persistence of 
the groups of stars, found by Eggen [139], [140], [141], [142), [143], 
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[144], [145], [146], [147] and Eggen and Sandage [148], for long 
time intervals (see [149]). These are groups of high velocity stars 
with a rather small dispersion in their space velocities, but not so 
near each other so that they are gravitationally bound. We have 
seen that the dispersion on the meridian plane is small. More ac- 
curately after the stars are dispersed they come again close together 
with almost the same velocity (or with opposite velocity, or even 
with an approximately symmetrical velocity with respect to the axes 
£ and z ) . 

The only marked dispersion of these stars is due to the galactic 
rotation. This effect has been discussed by Woolley [150], |151| 
and King [152]. According to Woolley the periods of galactic rota- 
tion of the stars of a group must be very nearly equal; i.e., the 
dispersion of their plane velocities must be very small. According 
to King, however, the radial velocities may not be so nearly equal. 
The stars originating at a given point come close together again 
after one oscillation in the radial direction. This view is similar 
to our own regarding the motion in the meridian plane (see [ 149 1 ) . 

The theory of epicyclic orbits in three dimensions has been dis- 
cussed by Chandrasekhar [ 153 ] and lately by Shimizu 1 154 1 and 
Emoto 1 1 55 1 . 

The projections of the three-dimensional orbits on the plane of 
symmetry of the Galaxy have been considered by Contopoulos [ 156 1 
and Yasuda 1 157 ] . The orbits corresponding to a spherical galaxy 
are osculating with the projections of the real orbits, in the same 
way as the Keplerian orbits osculate with the real orbits of the 
planets in the solar system. A system of canonical variables for 
the galactic orbits has been developed by Contopoulos and Bozis 
[158]; it permits the study of perturbations of galactic orbits due 
to the ellipticity of the system, or to external influences. The use- 
fulness of the methods of celestial mechanics in these problems is 
quite evident. 

A similar problem to that of the three-dimensional orbits in a 
Galaxy is the calculation of the orbits of artificial satellites around 
the oblate earth. In the meridian plane the orbits fill a certain 
space defined by a third integral; in the three-dimensional space 
the orbits fill in general a torus that is similar to the van Allen 
belts. An application of an approximate third integral of motion 
to the orbits of satellites was made by Diliberto, Kyner and Freund 
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[159]. Vinti [160] gave a separable potential that represents ap- 
proximately the potential of the earth and gives 3 isolating integrals 
of motion. 

c. Ergodicity . In the case of galactic orbits, that do not extend 
to infinity, in 2 or 3 dimensions, one can apply Poincare’s theorem 
and state that almost all moving points will pass an infinite number 
of times near the initial point. In fact, in these cases an invariant 
set ft of finite measure is always present. In the case of 3-dimensional 
orbits the section of the above set by the plane of symmetry of the 
Galaxy is an annulus between two circles of radii r x and r 2 . Its 
section with the 3-dimensional space is a torus generated by the 
curve of zero velocity. 

Poincare’s theorem, however, does not tell us if an orbit will 
fill the whole annulus or the whole torus. This can be decided by 
means of the so-called ergodic theorem. If the set ft cannot be 
separated into invariant parts ft l5 ft 2 , • ♦ ■ of positive measure, then 
it is called metrically indecomposable or metrically transitive. In 
this case an orbit fills the whole space equally densely. The moving 
point remains inside any given subset w C ft of measure m for a 
fraction of the total time that is asymptotically equal to the ratio 
m/M, where M is the measure of ft; i.e., if the moving point stays 
inside w for time intervals t x + t z + t 3 + • * * = t during the total time 
T, then the limit t/T for °° exists and is equal to n/M. 

This theorem was proved by Birkhoff [161 ]. Another proof, due 
to Kolmogorov, is mentioned by Khinchin in his book on the founda- 
tion of statistical mechanics [162] ( H ). 

The main difficulty of the ergodic theory is to prove that a given 
system is or is not metrically transitive. No such general theorem 
is available. Oxtoby and Ulam [165] proved what seems to be the 
most general theorem in this direction, namely that the measure 
preserving flows are metrically transitive “in general.” However, 
it is possible that the dynamical systems are exceptional in the 
sense of Oxtoby and Ulam. (See e.g., Diliberto, [166]). This pos- 
sibility is closely connected with the existence of some “hidden” 
integrals of motion. In fact if there exist a number of isolating 


( 8 ) For further reading see the monograph of Munster [163] in the Handbuch 
der Physik. Many mathematical problems of the ergodic theory are discussed 
by Jacobs [164]. 
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integrals the orbits lie always on the “surfaces” defined by the 
constant values of these integrals. If, however, only one or two 
integrals of motion are known, we cannot assert that the orbits will 
fill the whole space defined by these integrals, because it may be 
that another isolating or quasi-isolating ( y ) integral, not known 
already, is present. In fact if by calculating some orbits we find 
that they do not fill the whole permissible space defined by the 
known integrals, this is an indication of the existence of a further 
isolating or quasi-isolating integral (see [74], [85]). 

In this connection Moser [167], [168] and Arnol'd [169], [170] 
proved a very important theorem. Under some very general as- 
sumptions periodic orbits in a two-dimensional potential field are 
stable; namely in the 3-dimensional space defined by a constant 
value of the energy constant the orbits either lie on tori surrounding 
the periodic orbits, or they are contained between two tori of this 
kind. In both cases the orbits are not ergodic, i.e., they do not fill 
the whole energy “surface.” In our terminology these cases are 
either isolating or quasi-isolating. 

4. Some further problems. The numerical calculation of orbits in 
different potential fields is of the greatest interest, especially as 
regards the existence of further integrals. 

For example one should calculate some orbits in slightly elliptical 
systems such as the elliptical clusters, the elliptical galaxies and 
the oblate earth. In a spherical potential field the orbits fill, in the 
meridian plane, an area between two circles and two straight lines 
symmetric with respect to the plane of symmetry (Figure 9). For 
slightly perturbed fields the area filled by the orbit should be slightly 
different from the above. This has been found in some examples 
calculated recently by Contopoulos and Danby. However in some 
special cases of slightly elliptical systems the orbit fills a parallelo- 
gram (see [85]). A number of orbits have been calculated in some 
resonance cases of the third integral. In the expansion of the third 
integral in the form of a series there appear an infinite number of 
divisors of the form m 2 P — n 2 Q and some of them may be very 
small or even zero. When one such value is zero we have a resonance 
in the unperturbed motion (when we have two uncoupled oscillations 

( y ) A quasi-isolating integral is neither isolating nor ergodic; in general an 
orbit does not lie exactly on some isolating surface in the phase space, but is 
contained between two such surfaces (see [74]). 
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in two perpendicular directions) and the form of the third integral 
changes (see [152]). It has been found, however, that in the case 
of higher order resonances, (where P 1/2 /Q 1/2 is not equal to 1/1 or 
to 2/1), the forms of the orbits do not change very much. Only 
if P 1/2 /Q 1/2 is 1/1 or 2/1, do the orbits change appreciably. Such 
resonance cases should be considered in other problems also. 

zl 



Figure 9. Recurrence Region for Spherical Potential 

W e have calculated some orbits in the case of an axially symmetric 
system that has not a plane of symmetry. It is evident that a third 
integral exists in this case too. In the case of a system that has no 
axial symmetry, there are two integrals besides the energy integral 
(see [85]). These, however, do not correspond to the angular mo- 
mentum and the third integral of the axially symmetric case. 

One could think of other more complicated potential fields also. 
Such is the case of the 3-body problem or the rc-body problem. In 
all these cases the calculation of some orbits may indicate the ex- 
istence of new isolating or quasi-isolating integrals, whenever such 
integrals exist. 

Of special importance is the calculation of periodic orbits. The 
periodic orbits play an important role in classical celestial mechanics 
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and they should play a similar role in stellar dynamics. A few 
periodic orbits in the case of the potential field (131) have been 
calculated by Goudas and Barbanis [ 171 J. The initial velocities of 
these orbits are perpendicular to the plane of symmetry. We found 
also periodic orbits with nonvertical initial velocities. 

The calculation of orbits in a realistic potential field, representing 
our Galaxy is of importance for another problem also. In this way 
one could find the evolution of the distribution of stars of different 
ages in the Galaxy. E.g., one could find if the A stars were more 
concentrated in spiral arms in previous times, and so find eventually 
the age and the points of origin of these stars. This work necessitates 
accurate values of the proper motions of the stars as well as a good 
knowledge of the galactic field. The calculations of the perturbations 
due to the spiral arms is of interest in this connection. 

Another application of the third integral is in constructing models 
of stellar systems by means of a distribution function F that de- 
pends on three arguments, E , C, and $. Most of the models con- 
structed until now depend only on one or two arguments. Therefore 
it should be of great interest to have more general distributions, 
including the third integral also. 

IV. The statistical approach. 

1. Introduction. The main purpose of the statistical approach of 
stellar dynamics is to connect the two former approaches of particles 
and continuum. This is essentially the object of statistical mechanics 
in general. In fact in statistical mechanics a probability is defined 
for any given particle to be in any given region of the phase space. 
The distribution of a great number of particles is assumed to follow 
this probability law which gives a continuum distribution. 

This method, however, is not easily applicable to stellar dynamics; 
in fact a consistent definition of probability in the case of stellar 
dynamics has not yet been given. 

A “probability density” p is a positive quantity that is normalized, 
i.e., Jlpdw = 1 for all the phase space m, and it is an integral of 
the equations of motion. The last property is derivable from the 
following fact. If the probability that a particle belongs to a certain 
set w in the phase space is P, then after some time t the set w is 
transformed into (f>{w,t) and this has the same measure as w. On 
the other hand the probability of t) is equal to P, because 
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each point of w defines uniquely one point of <p(w, t). Therefore 
the probability density p = lim (P/m(w)) is constant along an orbit. 

We can define as probability density the quantity p = f/N , where 
f is the distribution function and N the total number of stars of 
a stellar system. Such a probability density would give only the 
known continuum approach of stellar dynamics. However, here we 
want to find /, as it is done, for example, in classical statistical 
mechanics, by using the ergodic theorem or assuming equal a priori 
probabilities. 

The form of f may be found by adding a new element in our 
picture, and this is the interaction of the stars among themselves. 
In the continuum approach we never used the fact that the indi- 
vidual stars which compose the continuum attract each other ac- 
cording to Newton’s law. If these interactions of the stars with 
each other are taken into account, we may be able to derive a 
“final” distribution function towards which any initial distribution 
of positions and velocities tends, or at least to find the evolution 
of a distribution function in the course of time, once the initial 
distribution function is given. 

This problem can be solved, in principle, by numerical integra- 
tion. This, however, is equivalent to solving completely the rc-body 
problem, and this is quite impossible in practice since the number 
of the gravitating bodies is very great. Therefore, we should rely 
on some general principles. 

The basis of statistical mechanics is the assumption of equal 
a priori probabilities for equal volumes of the phase space. In order 
to do the same in stellar dynamics it is sufficient to find an invariant 
set 17 of positive but finite measure; then a function p that is a 
positive constant inside 17 and zero outside it can be taken as a 
probability density. Kurth [4], [172], however, remarks that no 
such invariant set is known and probably does not exist; therefore 
no statistical mechanics of stellar systems exists. For this reason 
he rejects all statistical methods in dynamical astronomy, such as 
the notion of the relaxation time, the Boltzmann distribution etc. 

However there is at least one case, mentioned above (in §11.4) 
where we can define an invariant set 17. This is the case of the 
plane restricted 3-body problem. In this case a probability density 
can be defined. It should be very important to find similar cases 
in the n-body problem also. The results of Arnol'd [33] open new 
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possibilities in this direction. 

Further one may find criteria to distinguish between orbits ex- 
tending to infinity and recurrent orbits. Then we might apply our 
considerations to the set of the recurrent points only. 

But even if these possibilities are not realized, one can apply 
quite generally the methods of statistical mechanics to stellar dy- 
namics, under some “reasonable” assumptions, as a convenient 
means to study approximately the behavior of stellar systems. 
Such a discussion has been made by Heckmann [ 173 J. 

The statistical mechanics of stellar systems can be applied in a 
region of the phase space that is not infinite, whether this region 
is really invariant or not, if we have reasons to assume that the 
motions that extend outside this region are rather exceptional. This 
method is of course, only approximate, but it has the same value 
as the approximation of a stellar system by a continuum, or even 
by n gravitating bodies moving according to Newton's law. 

In fact the main argument of Kurth against the statistical me- 
chanics of stellar systems is that it contradicts the theorems of 
Hopf. The statistical mechanics considers the evolution of a stellar 
system as an irreversible process, i.e., as a one-way evolution toward 
a final state which we might call the state of maximum entropy. 
According to the theorems of Hopf, however, there is no such final 
state. The system is either recurrent or its representative point goes 
to infinity; but the second alternative happens only if the representa- 
tive point came from infinity. We may say more loosely that a 
system disintegrates only if it was formed by captures. A recurrent 
system, however, is almost periodic for infinite time, and no final 
state exists, unless it is periodic. 

This objection is quite similar to the famous “recurrence ob- 
jection” (Wiederkehreinwand) of E. Zermelo against the irreversi- 
bility of statistical mechanics. Zermelo’s argument, based on 
Poincare's theorem, states that the increase of entropy of a system 
is followed by a corresponding decrease of entropy as the system 
returns arbitrarily near its initial state. In fact, such increases and 
decreases of entropy alternate, and evolution of every physical 
system is oscillatory. 

A very interesting “conciliation” of the recurrence theorem 
with the classical views of thermodynamics is due to P. and T. 
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Ehrenfest [174]. The Ehrenfests point out (a) that if at some time 
the entropy is much less than its maximum value, it will practically 
always tend to increase, (b) that this is true for both time directions, 
and (c) that the entropy remains quite near its maximum value 
practically all the time. In other words the time interval during 
which a system is far from equilibrium is quite small in comparison 
with the time of equilibrium. Therefore, the actual Universe is a 
very improbable deviation from equilibrium. 

However, such deviations from equilibrium do exist, although 
they are very rare. In the case of a gas the time of relaxation, i.e., 
the time needed to reach equilibrium, is of the order of 10 _y seconds; 
whereas the recurrence time for one cm 3 of gas (10 18 atoms), i.e., 
the time needed so that each atom should return near its initial 
position with nearly the same velocity, has been calculated by 
Boltzmann to be of the order of 10 luly years! In the case of a stellar 
system the relaxation time is of the order of 10j to 10 14 years, 
while the recurrence time is at least of the order of 10 1,oo ° to lO 1,000300 
years as we have already calculated (in §11.5). Therefore, even if 
a stellar system is recurrent, the statistical methods can be applied 
if they deal with time intervals small in comparison with the re- 
currence time. 

The Ehrenfests’ approach has many advantages, but it has one 
main disadvantage in that it considers the two time-directions as 
essentially equivalent. However, the uniqueness of the time direction 
is one of the most basic principles of physics, of the same status 
as the principle of causation. 

On the other hand, the gravitational law of Newton and the laws 
of mechanics in general are symmetric with respect to the past and 
future. One can escape from the dilemma only by assuming a limited 
extrapolation of any physical law and the laws of mechanics in 
particular. One can practically never extrapolate the laws of me- 
chanics for intervals as long as the recurrence times. For example, 
the accuracy with which the changes of the angular elements of the 
planets are known, on which the accuracy of the law of Newton 
is based, is of the order of l"/century (see [175]). This corresponds 
to an inaccuracy of 90° in about 4 X 10 12 years, which is roughly 
of the order of one relaxation time but quite small in comparison 
with the recurrence time. 
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If we assume that Newton’s gravitational law changes slowly in 
time then our proof of Hopf’s theorem is no more valid. In practice 
there are many quite evident reasons why one cannot extrapolate 
Newton’s law for extremely long time intervals. The loss of mass 
of the stars, their friction with the interstellar matter, or the actual 
collisions of stars, completely change the behavior of a stellar system 
in time intervals quite small in comparison with the recurrence 
time. Fricke 1 176 1 has calculated the effects due to a resisting 
medium on the motion of the stars. From his formulae we find 
that for a density of the interstellar matter of the order of 10 
gram *, the velocity of a star is reduced to one half after a time 
of the order of a hundred to a thousand times of relaxation. There- 
fore, Newton’s law is applicable only for time intervals of the order 
of a thousand times of relaxation. In much longer times the small 
irreversible effects become predominant. Hence in the long run all 
the physical and astronomical phenomena show a definite time- 
direction. 

The theorems of Hopf are very useful as regards the mathematical 
theory of the n-body problem, but not as regards the behavior of 
real stellar systems. For these reasons the use of statistical methods 
in stellar dynamics is well justified; in fact in many cases these 
methods give us more information than particle mechanics. 

2. The Relaxation Time . As a first illustration of the application 
of statistical methods to the problems of stellar dynamics we shall 
give the main steps of the calculation of the time of relaxation of 
a stellar system. The first satisfactory calculation of the time of 
relaxation was made by Chandrasekhar [2j. 

There are several definitions of the time of relaxation. Here we 
shall use the following: The time of relaxation T E is the time needed 
so that the mean change of the kinetic energy of a star due to 
encounters is equal to its initial kinetic energy. Namely, if E is 
the kinetic energy of a star and SE is the energy exchange in each 
encounter, then in time T E we get the equality 

(172) ZUE) 2 = E\ 

This time is found in the following way: Each encounter is con- 
sidered approximately as a two-body problem and (A E) 1 is cal- 
culated; then we take the contribution of all the encounters suffered 
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by a star in time At. The sum AE ) 2 becomes equal to E 2 when 

At = Te* 

Let v g be the velocity of the center of mass of two stars with 
masses m u m 2 and velocities v u v 2 ', i.e., 


(173) v g = + m 2 v 2 ). 

ni\ ~r vft 2 

On the orbital plane (where the relative motion of the two stars 
takes place) we have the relative velocity 

(174) 


Hence 

(175) 


— mii; 

"2 = : 

mi + m 2 


+ v g . 


The relative velocity does not change in measure during the en- 
counter, but it is deflected by an angle tt — 2^ (Figure 10). D is 
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the distance of m 2 from the initial velocity vector of nil and is 
called the “impact parameter”. If 


(176) 


a 2 b 2 


is the equation of the relative orbit, then 

tan 2 \p = b 2 /a 2 , 


(177) 
and 

while 

(178) 
where 

(179) 

Therefore 

(180) 
hence 
(181) 


b = D, 

b 2 /a = p = C 2 /(G(m i + m 2 )), 

C = The constant of areas = Dv. 
G(m, + m 2 ) 

a = o » 


tan> 


DV 

G 2 (m, + mj 2 ' 


Then (see (175) and Figure 11) we have 

_ _ _ m, 


V2g=Vg-Vl = 
v' lg = Vg-v ' 2 = 


fTl 1 -f- 7712 
/7Tj “j - m 2 




ir . 


As v and v f have equal length, so do and v 2 *. 

Let $ be the angle between v g and v (Figure 11); then 


(182) 


vj = v 2 + 


2m! 


+ m 2 

Similarly after the encounter 


v B u cos4> + 


\m, + m 2 ) 


2m! , , / mj V 2 

(183) oi‘ - v‘, + »,«cost + ■> • 
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because the length of v remains the same. Hence the change 
in kinetic energy during the encounter is 


(184) A E = - m 2 (v 2 2 — v'i) = mim2 VgV(cos$' — cos4>). 

2 m x + m 2 

If i is the angle between v HP , the projection of v g on the orbital 
plane, and v g and 0, 0' the angles between u , v f and v gp (Figure 12), 
then 

(185) cos4>= cos <t> cost, cos4>' = cos 0' cos L 

But 

(186) 0' — 0 = 7T — 20. 

Hence 


(187) 


0 + 0 ' 
2 


therefore 

cos#' — cos4> = cos i (cos <f>' — 
(188) 


2 “* + *’ 


— V 0 . 0 + • 0 0 

cos 0) = 2 sin — - — sin — — — cos i 
z z 

= — 2 cos i cos(0 — 0) cos 0 


and 


(189) 


aE = - 


2m x m 2 
m , + ^2 


cos(0 


0) cos 0 cost. 


Let us calculate now the number of encounters of a star with 
velocity v 2 with other stars. If (^,0,0) are the polar coordinates 
of the point v x in a coordinate system having as 2-axis the velocity 
v 2 (0 is the angle v Y v 2y and 0 is the azimuth of the plane v x v 2 ). Let 
N{v u 8,<f>) dVidd d<f> be the number of stars with velocities in the 
range {v u + dv { ), (0, 0 + d0), (0, 0 + d0). The number of stars 
with impact parameters between D and D + dD encountered in 
time dt is proportional to 2* DdDvdt. 

If O is the angle between the orbital plane and the plane u t v 2y then 
the proportion of stars having this angle between 0 and 0 + d 0 
is dO/2ir. Hence the change of energy of v 2 in time dt , due to en- 
counters with stars with the parameters v u 0, 0, D , O in the above 
range, is 
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£(A E)'\v u e,<t>, D, G) 


(190) 

= {±E) 2 2*N{v u e,<t>) vDdD 


de 

2i r 


du 1 d8d4) dt 


= AN (v u 6, <t>) V y ( ) CO&H cos \4> - i) cosV DdDde dv x de d<j> dt. 

Now we must integrate over all the values of D, 0, 6, </> and v x to 
find the total change &E ) 2 in time dt. By (181) 


(191) 


DdD 


G 2 (m x + m 2 ) 2 sin^ 

v 4 COS'')/' 


d 


and for D — > 0 we have ^ > 0 (because the deviation - 2^ then 
tends to 2ir), while for D — ► °°, ^ — >t/ 2 (the deviation tends to zero). 
Hence 


X(A£) 2 (ei,0,0, 0) 

(192) 

Va r ~ 

= 4 N{v u d,<t>) G z mtmiy cos 2 * I cos 2 (0 - \P)tain\pdtdQdv [ ddd<t>dt. 


This becomes infinite logarithmically as ^ 0 -^tt/ 2. However when 
D is great we cannot consider the encounters as two-body prob- 
lems. If D 0 is the distance of the neighboring stars, and D » D 0 there 
are many encounters at the same time which probably cancel 
each other. On the other hand if D < D 0 the formula (192) over- 
estimates each {±E) Z because an actual encounter lasts less time 
than in the case of a two-body problem. For this reason Chandrasekhar 
takes D = Dq as the upper limit of the impact parameter. It is 
to be noted that a change of this limit by a # factor of 2 does not 
appreciably change the value of (SE) 2 {v h 8, 0, 0) because it con- 
tains the term 


r + u 

I cos 2 (tf> — \p) dlogcos^, 

and this does not change very much if tan \p 0 changes by a factor of 2. 

This is the most delicate point in the calculation of ^(sE) 2 . 
The integrations with respect to 0, 8 , $ are made relatively easily 
if the distribution of velocities is assumed to be spherical (i.e., 
N(v u 8 j( t)) = N(vi) sin0/47r), and if we drop all the numerically un- 
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important terms. Finally the integration with respect to v t is effected 
by assuming a Maxwellian distribution of the velocities v lr i.e., 

(193) N(vi) dvi = N exp( — j u vf) ufdv u 


where N is the number of stars per unit volume. 
Finally we find 

T(ae y dt 

(194) e~ = t m 

where 


(195) 


Here 


vi 


S2ir NG 2 miG(x 0 ) In 


D 0 vi 


G(m, + m 2 ) 


(196) x 0 = jv 2 and G(x„) = — ^(H(x 0 ) - x 0 H'(x 0 )), 

ZXq 


where H is the error function 


2 r x u 

(197) H(x 0 ) = — z exp( - x 2 ) dx . 

7T JO 

Hence if dt — T E then £(AE) 2 = therefore T E is the time of 
relaxation. 

For our Galaxy N = 0.1 star/pc^, Z) 0 = 2.7 pc, ra^O.5©, v 2 
= 20 km/sec, j = 0.032 sec/km and we find that 

T e = 7 X 10 M years. 

This quantity is big enough so that the individual orbits of 
stars do not change appreciably because of the encounters. During 
the revolution of the sun around the center of the Galaxy (2 X 10 h 
years) the mean change of the energy of a star is about 0.0017E. 
Correspondingly the mean change in the direction of the motion 
of a star is A\p ^ 0P07. For this reason we have neglected the 
stellar encounters in calculating the orbits of stars in a stellar 
system. 

If the velocities v 2 also have a Maxwellian distribution we find 
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(198) 


j B l_ / 3 \ 1/2 (»f) 3/2 _ 

E 16 NG ^ m ^ ln(D 0 V2/2Gm) , 


if the masses of all the stars are equal. 

In the case of a spherical cluster we may apply the virial theorem 
and have 


(199) 


v 


2 


Gmn 

2R* 


where n is the number of stars, and R* = r * / 2 is the value used 
by Chandrasekhar as the radius of the cluster. If we draw around 
each star a sphere of radius D 0 / 2, the volume of all these spheres 
is approximately equal to the volume of the cluster; i.e., 


(200) 

4 ( D °\ 6 

3* \ 2/ n 

hence 


(201) 

^-= 2n 

R* 

Also 


(202) 

i icR* 3 N = 
3 


Setting these values in formula (198) we get 


(203) 


T e 


1 / 3tt\ 1/2 / nR* 6 \ ia 1 
16 \ 2 / \ Gm / ln{n/2' i Z ) * 


As an example, the time of relaxation of the Pleiades is T E 
= 2.9 X 10' years. Therefore in this case the time of relaxation 
is much shorter than the age of the cluster; i.e., the encounters 
must have already achieved a statistical equilibrium. The same 
thing happens in the central parts of the globular clusters also 
(see [177], [178]). 

Many astronomers like Charlier, K. Schwarzschild, Rosseland, 
Heckmann and Siedentopf, Spitzer, and Fricke have worked also 
on the problem of the time of relaxation. Fricke [179] has pointed 
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out some errors in the earlier discussions. However, Fricke’s method 
also (which is presented in von der Pahlen’s book [3]) is not con- 
venient, as pointed out by Heckmann [180 J. 

Chandrasekhar has made another approach to the same problem 
using purely statistical methods (see [181]). He finds almost exactly 
the same values for the time of relaxation. 

The above calculations, however, consider only the interactions 
of stars among themselves. Some authors have considered also the 
effect of the interstellar medium on the relaxation of stellar veloci- 
ties. Spitzer and Schwarzschild [182], [183] have studied the effect 
of the interstellar clouds on the motions of the stars. They found 
that because of the encounters of the stars with interstellar clouds 
the time of relaxation is shortened very much and is only some 
10 ri years for our Galaxy. This is due to the fact that T E is inversely 
proportional to m 2 N, where mN is the mass density. Hence if the mass 
density remains the same, but instead of stars of mean mass 0.50 
we have big clouds of mass m = 10 J O, the time of relaxation is 
2 X 10 5 times smaller. Only for the high velocity stars is the time 
T e still big enough, since it is proportional to v'j' 2 . 

Osterbrock [ 184 1 found similar numerical results by assuming that 
the stars move in a fluctuating interstellar medium. Similar con- 
clusions are also reached by Ogorodnikov [185]. 

A few years ago Henon [186] calculated the time of relaxation in 
a way different from that of Chandrasekhar. He considered the 
distant encounters as perturbations instead of two-body encounters. 
This is more reasonable than completely disregarding the distant 
encounters. 

Henon’s formula is similar to that of Chandrasekhar, except that 
the logarithmic term of Equation (203) is replaced by 

\n(2v A T/e 2 G(m l + m 2 )). 

This means that the perturbations £ (A E) 1 are not proportional 
to T, but increase initially a little faster than T. This, however, 
seems to be due to the method of calculation used and not to a 
real physical phenomenon. 

When uT becomes ^ A, where A is the “Jeans length” (the di- 
ameter of the condensations formed by gravitational instability 
(see [69, p. 348]), then vT is replaced by 


PROBLEMS OF STELLAR DYNAMICS 


241 


(204) 


X = 


irV 2 Y' 2 _ 2tt r* 

3 GNm) 3 


^2 r*; 


r* is the_radius of the cluster given in §11.1 by the formula r* 
= GM/v 2 , and Nm = 3M/(47rr* 3 ). Therefore the “Jeans length’’ 
is roughly equal to the diameter of a cluster. Henon gives D 0 by 
the formula 


(205) D 0 = N - 1/3 = 


3 n 
4tt r 


*3 


i 3 


r*n ia <§:2 r*. 


hence 


X»D 0 . 


The logarithmic term in Equation (203) becomes 



i.e., Chandrasekhar’s time of relaxation T E must be multiplied by 
about 2/3. The change is unimportant. 

The same formulas apply also in the case of a plasma (see [187], 
[188]). In this case 


(206) 


/ kT \ 1/2 
\8irNe 2 ) 


( T is absolute temperature, k is the Boltzmann’s constant, N the 
particle density, and e the electron’s charge), and this is the so- 
called “Debye length,’’ i.e., the distance over which the Coulomb 
forces are effective in a plasma. At greater distances a screening 
effect due to charges of opposite sign does not permit the inter- 
action of the charges. 

In the case of a stellar system this screening effect does not appear, 
and therefore X is equal to the diameter of the system (see [189]). 
However it should be of interest to find out the action of distant 
perturbations, in connection with the problem of the fragmentation 
of a system into clusters of stars. Jeans’ study refers to instabilities 
that occur in a continuum if the wavelength of a density variation 
exceeds a critical value which is the Jeans length. Similar work 
was done lately by Lynden-Bell [190]. The problem now is to find 
how far these considerations apply to real stellar systems composed 
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of interacting stars and not only to continuum media. Layzer 
[191] has recently criticized Jeans’ results; he points out that 
nonlinear effects probably do not permit fragmentation. 

In this connection one should consider the problem of formation 
of condensations of different sizes (galaxies, stellar clusters, stars) 
especially in rotating systems. It is not quite clear if formula (204) 
can be applied also when v refers to a rotating coordinate system, 
i.e., if it is the velocity with respect to the local standard of rest. 

Some experimental calculations of the time of relaxation in a 
plasma have given values that are 1000 times smaller than cal- 
culated theoretically. This is the so-called Langmuir’s paradox 
(see [192], [193]). Fricke [66] has suggested that perhaps forces 
other than Coulomb’s play a role here. A few years ago, however, 
Gabor, Ash and Dracott [194] showed that some oscillatory fields 
are generated at the walls of the plasma containers adding energy 
and entropy to the electrons, and thus causing a Maxwellian 
distribution to be reached very soon. 

A problem similar to that of the time of relaxation is the problem 
of dynamical friction (see [195], [196], [197], [198], [186]), i.e., 
the continuous decrease of the velocity of a star along its orbit due 
to interactions with other stars. 

Chandrasekhar’s time of relaxation refers to systems with 
constant density. In actual systems, however, the density de- 
creases outwards. Therefore the time of relaxation in the outer 
parts of a stellar system is larger. Consequently a system cannot 
be very approximately in equilibrium except in its central parts. 
Further, the time of relaxation for a circular orbit is larger than 
for a radial orbit that has the same apocenter. Calculations of the 
time of relaxation in different models of star clusters were made 
by Woolley and Robertson [200], [71] and by King [201]. King 
finds a mean time of relaxation 4 times greater than given by 
Chandrasekhar’s formula, for clusters with a central condensation. 

3. The Escape of Stars from a Cluster . One of the most important 
problems in the evolution of a stellar cluster is the escape of stars 
from it and the subsequent dissolution of the cluster. This problem 
has been studied by a number of authors, including Chandrasekhar, 
King, Henon, von Horner, etc. In order that a star should escape 
from a cluster its total energy must be positive or zero; i.e., its 
velocity must become equal or greater than where 
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(207) 


o m <vl= G^T 


minij 


j=*l;i*j ' ij 


(j ^j). 


and further encounters should not reduce the velocity at any point 
below the corresponding v,„. We may omit the latter effect as of 
unlikely occurrence in general (see [199]). 

Adding all similar terms we find 


(208) 


1 » 1 

- X m iVt= 7: mnvl= - 2v = 

2 i=l 2 


Gm 2 n 


or ui = 


2 Gmn 
~R ~ m 


On the other hand from the virial theorem we have v 2 = Gmn/2R* y 
hence 


(209) u‘i= 4 v 2 

In a Maxwellian distribution of velocities 

(210) dn = ex P( — J 2 ^ 2 ) v 2 dv , 
we have 

(211) u*=3/2j 2 , and 
Hence the fraction of stars 


(212) Q = A?2 f exp(— x 2 ) x 2 dx = 0.0074 

7T J 6 1 ' 2 

have a velocity greater than the velocity of escape. 

As the stars escape, however, new stars acquire velocities 
greater than the velocity of escape because of the encounters, 
and so we have a continuous escape of stars. It is argued that 
during one time of relaxation, T Ey a Maxwellian distribution is 
restored in a cluster and therefore in n x times of relaxation a fraction 
of A n/n = n^Q stars will escape. Hence in general the proportion 
of stars escaping in time Af is 

(213) — = 0.0074 

" t e 

In other papers Chandrasekhar [195], [196], [197] calculates 
the probability Q(r) that a star will escape from a cluster in time 
r. He finds first that if we omit the action of dynamical friction, 
the cluster is dissolved completely in a few times of relaxation. 
An open cluster is dissolved in a few 10 8 years. If, however, dy- 
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namical friction is taken into account, the lifetime of the clusters 
is much longer. Chandrasekhar finds then that Q(t) = 1 — exp(— r/T 0 ) 
where T 0 is the half-life of a cluster equal to T 0 — 266 T £ ( lu ). Hence 
the half-life of the Pleiades instead of being of the order of 3 X 10' 
years becomes of the order of 3 X 10 y years. 

Recently the problem of the ejection of stars from a cluster has 
been considered in detail by many authors. 

King in a series of papers [199], [201], [202], [203], [204], made 
some improvements on Chandrasekhar’s theory. He considered the 
variation of the time of relaxation Te with time. As the stars escape, 
the remaining cluster contracts continuously and Te decreases. 
Therefore the total life-time of a cluster is found to be only about 
40 Teoj where T E0 is the initial time of relaxation. 

Similar results were found by von Horner [205]. He finds a dis- 
solution time that is about 307\’o* The difference between King and 
von Horner is that King assumes that escaping stars take almost 
no energy with them, whereas von Horner assumes that they take 
a fraction of the total energy equal to 0.0052 (equal to the frac- 
tional energy of the escaping stars in a Maxwellian distribution, 
when they reach an infinite distance from the center) during one 
time of relaxation. Earlier L. Spitzer [206] supposed that each star 
takes away an amount of energy mv 2 / 2 where v 2 is the mean square 
velocity. This is, of course, an overestimate. Probably King is nearer 
to the truth, because a star escapes just as it acquires the escape 
velocity and probably it takes away only an insignificant amount 
of energy. It is evident that the Maxwellian distribution cannot be 
valid for velocities bigger than the escape velocity. 

After most of the stars of cluster escape there will remain probably 
only a stable multiple star, or even a double star. King found that 
if this double star gets all the energy of the cluster it will be just 
a normal double star. 

Henon [41], [42] has made a critical study of the escape mech- 
anisms of a star from a cluster. He points out that a gradual in- 
crease of stellar velocities cannot result in the ejection of stars; 
because if a star gets a velocity near the velocity of escape, it 
moves in a very elongated orbit and only rarely comes near the 
center where encounters take place. In fact, the star never acquires 


( iu ) The formula used by Chandrasekhar is To = 133^) \ where vo — ^Te- 
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the escape velocity if it is only subjected to a continuous set of 
very small perturbations. Only during a very close encounter can 
a star acquire abruptly the escape velocity. Such encounters, how- 
ever, are very rare. 

Henon finds that the escape rate is given by the formula 


( 214 ) 


dn 

dt 


- 0.00426 


Gmn \ l/ 2 

) 


for the Plummer model, where n is the number of stars and r 0 is 
the radius including, in projection, one half of the total mass. In 
the case of a globular cluster (n = 10 °) this quantity is about 150 
times smaller than the value 


( 215 ) 


dn 

dt 


- 0.125 (log 10 n - 0 . 45 ) 


Gmn 

H** 


derived from Chandrasekhar’s formulas, as given in [2]( u ). It is 
also about 40 times smaller than the corresponding value given 
by King. Similar calculations were made recently by Woolley and 
Dickens [207]. 

Another important factor that influences the rate of escape of 
stars from a cluster is the effect of the tidal field of the Galaxy 
and of the passing-by stars and interstellar clouds. This problem 
also has been considered by a number of authors. 

The effect of the tidal force of the Galaxy has been considered 
by Bok [208], Mineur [209], Chandrasekhar [2], von Horner [205], 
King [210] and others. In order that a cluster should not be un- 
stable under the tidal forces of the Galaxy, its density must be 
higher than a certain limit, which is about 0 . 1 © /pc 3 . This condition 
is realized in the real clusters; e.g., in the Pleiades the density is 
about 15 times the critical density. However, if a star goes out 
farther than a limiting distance from the center of the cluster it is 
usually lost from it because of the tidal forces. The limit according 
to von Horner is 


( 216 ) 


r s = Ro 


M \ 1/3 

2M g ) ’ 


where M is the mass of the cluster, M g the mass of the Galaxy 


( ) If we insert in (215) the value of G in units of parsec, solar mass and year, 
we get the formula (5.314) of Chandrasekhar. 
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and Ro the distance from the center of the Galaxy. If we set M g 
= 2 X 10 u O and R 0 = 10 kpc, we get 

( M \ 1,a 

<217) '■■- 13 ' 6 (iooo) • 

where r s is given in parsecs and M in solar masses. 

If the radius of a cluster approaches this limit, its dissolution 
is very fast. For R* = 0.5r s von Horner finds a time of dissolution 
equal to 87V If i?* is smaller the life time increases up to the 
limit 307V 

The effect of the encounters of a cluster with other stars on the 
rate of ejection of stars is small. However it has been found (Spitzer 
[211]) that clouds of interstellar gas going by a cluster exert a 
much greater influence than passing-by stars, namely about 30 
times greater. This effect is of the same order of magnitude as the 
tidal effect of the whole Galaxy on a cluster. Thus the disruption 
times of the actual clusters are of the order of 10 8 to 10 y years. 

King [203] has considered both the contraction of an open cluster 
due to the ejection of its stars and its gradual expansion due to 
the effect of the tidal forces. The loose clusters tend to be dis- 
solved by tidal forces, while the dense clusters are dissolved mainly 
by internal encounters. King finds a critical radius r crit (about 
lipc for a cluster of 100 stars), that separates the clusters into 
2 classes. The clusters with r < r crit are contracting, while those 
with r > r crit are expanding as their energy increases due to en- 
counters with clouds. In both cases the maximum lifetime of an 
open cluster is of the order of 10 y years. 

In a recent paper King [210] has considered three factors that 
define the structure of a cluster; namely the number of its stars, 
its energy, and the effect of the galactic tidal field. These correspond 
to the three parameters that enter the empirical density law of a 
cluster given by King 

u [ 1 1 1 2 
(218) P - k y (1 + (r/rc) y/2 _ (1 + (r,/r c ) Y' 2 _ 

In this formula r t gives the cutoff due to the galactic tidal field, 
r c gives the concentration of the cluster, and k is proportional to 
the number of stars. Formula (218) was shown by King to represent 
remarkably well the real density distribution of the globular clusters 
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and probably of the galactic clusters and dwarf elliptical galaxies 
also. The encounters do not seem to change the general structure 
of the clusters; they only slowly change the parameters entering in 
the above formula. King ascribes the similarity of the clusters to 
an initial relaxation process due to mixing during the formation of 
the clusters. However, the existence of 3 arbitrary parameters in 
formula (218) makes it possible to represent many density distribu- 
tions with sufficient accuracy. 

As yet there is no complete theory of the dissolution of the 
clusters that includes all the relevant factors. The main practical 
problem is to combine the effect discussed by Henon (that en- 
counters do not usually result in escapes of stars, but only cause 
some stars to move to the outer parts of the cluster), which tends 
to increase the lifetimes of the clusters, with the tidal effects that 
tend to reduce these lifetimes. Only when we have reliable lifetimes 
for the clusters we can compare them with observations, and 
derive cosmogonic consequences from them. 

4. The Evolution of Stellar Systems. When one calculates the time 
of relaxation or the rate of escape of stars from a cluster, he usually 
assumes the density function as given. For example, Chandrasekhar 
assumes a constant density, King assumes a polytrope, etc. There- 
fore one assumes some “reasonable’' model, found by means of the 
continuum approach of stellar dynamics, and then he proceeds to 
find the change of this model with time. 

However, the problems of the structure and evolution of a cluster 
are not separate. The present structure of a cluster is the result 
of its evolution. Therefore the statistical approach of stellar dy- 
namics should derive the distribution function f and its change in 
time from the interaction of stars among themselves. For example, 
one should prove a kind of Boltzmann’s H-theorem (which states 
that any distribution of energy tends to a Boltzmann distribution, 
and the corresponding distribution of velocities tends to become 
Maxwellian). 

These problems are not easy. We have not even succeeded in 
general in defining a probability density in the phase space of a 
stellar system. Further, a Boltzmann distribution corresponds to 
an infinite radius and mass. However there are some methods to 
escape from that difficulty. 

First, we may consider the fact that the time scale of the evolu- 
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tion of a stellar system is different in different places. F or example 
in the center of the observed clusters the time scale is comparatively 
short and a kind of statistical equilibrium has probably been reached. 
This equilibrium is slowly reaching the outer parts. However we 
may consider possible exceptions to this rule. Are all possible con- 
figurations of a cluster tending to the same equilibrium condition? 
It is quite probable that there are exceptional cases where such an 
equilibrium is never reached, even in the central parts of a cluster. 
Such are, for example, the cases where the motions of all the stars 
of the cluster are periodic. What happens then to systems that are 
“near” these exceptional cases? Do they reach an equilibrium, and 
in what time scale? For actual problems the calculation ol a rough 
time scale of the evolution is quite necessary. If this time scale 
is extremely long we may consider a cluster as not changing at all. 

Ogorodnikov [185], [212] considers stellar systems embedded in 
a greater stellar system that acts as a container. Such is the case ot 
a Galaxy that is surrounded by a halo. The halo exchanges mass 
with the Galaxy and when a steady state is reached, the number 
of stars absorbed by the halo is equal to the number returned to 
the Galaxy. Then a probability density can be defined and a statisti- 
cal mechanics of the Galaxy can be developed. 

This approach is good for short time intervals. It is not completely 
satisfactory because we cannot consider the halo as a given boundary 
and as a matter of fact it is a part of the Galaxy itself. However, 
until a better theory of statistical mechanics of stellar systems is 
devised such approaches may be useful. 

A quite different approach is due to Chandrasekhar and von 
Neumann [181 1, [213], [214|, [215], [216|, [217 1, |218). They start 
from genuine statistical considerations; namely they introduce an 
a priori probability density for each star r*(r„ uj, so that the proba- 
bility that a star lies between r, and r t + dr t and has a velocity 
between v t and v l + du l is T l dx lX dx il dx i: 4^i\dv l2 dv l ^ Then the total 
probability density in the phase space is T = where N 

is the number of stars. 

Chandrasekhar and von Neumann used for t, the function 
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and more generally an arbitrary function of vf and of the mass 
m. However T is not an integral of the equations of motion 
(see e.g., R. Kurth [172] ) because if 

j 6 

(220) T = — jpz exp ( — j 2 (v'i + v'i + ■ * * + ujv) ) = constant, 


then vf + v'i + • • • + v 2 N = constant, and this is not generally the 
case. Therefore this approach is not completely satisfactory, be- 
cause the probability density changes along the orbit of each star. 

However it has not been proved that a function t, with the re- 
quired properties does not exist. On the other hand one may not 
be interested in individual stellar orbits at all. In fact in most 
cases we may consider a stellar system as a gas where the motions 
of the particles are subjected to perturbations quite similar to the 
Brownian motions of the particles of a fluid. 

Chandrasekhar has calculated that the variations of the force 
acting on a star due to stellar encounters have a period of the 
order of 6 X 10 4 years in our Galaxy; therefore they are very fast 
in comparison with the period of galactic rotation of 2 X 10 a years. 
Thus we may consider the force acting on a star as due to the 
smoothed out distribution of the stellar system plus a rapidly 
fluctuating force, as in the case of Brownian motion. 

In order to describe the Brownian motion a generalization of 
Liouville’s equation is usually used that is called the Fokker-Planck 
equation (see e.g., [218]). If / is the distribution function, its change 
in time because of the collisions is given by the equation 

{%) \t + 0 (At 2 ) = A*.» 

\ dt > c *r; <>x, 


( 221 ) 


-Y Y - 

+ 2 hi hi 


(f(&Xi±Xj)) 


+ 0 ((Ax.-AifAx*)), (^e = 1,2,3), 


where (Sx t ) is the mean value of Ax, over the time A* and is given 
by the formula 



Ax,i/(x; Ax) d( Ax), 


(222) 
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where ^(x;Ax) is the transition probability that x will increase 
by Ax = (A*!, Sx 2 , Ajc 3 ) in time A£, and the integral is, in fact, triple. 
Similarly, 

(223) (Ax, Ax,) = | AXj Ax, ^ ( x ; Ax ) d ( Ax ) , 

etc. 

Liouville’s equation is replaced by the generalized Fokker-Planck 
equation 

(224) Tt + £ 

Ot | = 1 


df . df dV 

dX, 1 dX, dX. 




This equation corresponds to Boltzmann's equation in gases. If 

(225) $(x; Ax) = — -^^y-^exp I - I Ax + fix~At\*/{4qAt) j. 

(where the constant q is equal to fikT jm in a gas, and — fix is the 
dynamical friction), then (Ax,) = —fix t St y (Ax t Ax,) = O(aC) for 
i ^ y, and (A xf) — 2q At + O(aC). 

Hence 


(226) 

i.e., 



At T O(A^) 



(227) 



+ </Z 

1 


dxf * 


The Fokker-Planck equation now becomes 


(228) 


dt 


+ E 



(If d V J, Hfx t ) JL. (i'-f 

= /t*L — 77- + 

dx t dXj J “ dx t TTi dxf 


Equations (224) and (228) have been used by a number of 
authors in discussing the dynamics of an ionized plasma (see |187|, 
[219], 1 220 1 , [221]). These problems are very similar to those of 
stellar dynamics. 

A special application of Fokker-Planck’s equation to stellar dy- 
namics has been made by Chandrasekhar |195|, |196|, ] 197 1 in his 
study of dynamical friction. Another special case has been con- 
sidered by Spitzer and Harm [ 1 89 j , in their calculation of the escape 
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of stars from a cluster. A generalization of their solution was given 
by King [204]. 

A more general application of this equation was recently made 
by Henon [222]. Henon found a very important model of a stellar 
cluster that is homologous; i.e., this model is such that its evolution 
does not change the structure of the cluster but only its dimensions. 

It is known that a stellar system cannot in general have a final 
equilibrium state, because the Maxwellian distribution implies an 
infinite radius and mass. And it is only in exceptional cases that 
a system may tend to a configuration where all the orbits are 
periodic. However, it may be that a system tends to a homologous 
configuration; i.e., its final evolution is homologous. Henon has 
given examples where models initially different from the homologous 
model tend towards it. This is a case quite similar to the famous 
H -theorem of classical statistical mechanics, which states that 
the collisions tend to produce a Boltzmann distribution of the 
energy. 

The homologous model found by Henon has a finite mass and 
radius but its central density is infinite. However, the mass in 
any small volume around the origin is always finite. Henon con- 
sidered also some nonhomologous models which tend to the homolo- 
gous model, and their central density tends to increase so that it 
becomes infinite in finite time. 

In all these models the distribution function is a function of 
time and the energy integral only, f = f(E,t). Further work 
should include the other integrals too. 

A numerical solution of Fokker-Planck’s equation for a special 
form of the distribution function, /=/(£, C), including also the 
angular momentum integral, was given by Michie [223]. Michie also 
found a contraction of the core of the cluster, together with an 
expansion of its outer parts. Similar results were found lately by 
von Horner [10] by means of numerical calculations of the n - body 
problem. 

The effects of the encounters on the evolution of a stellar system 
may be divided into two classes; the effects due to close encounters 
(binary encounters), and those due to distant encounters. It is 
understood that the latter effects are more important than the 
former (see [187], [224]). A method that permits one to take into 
account not only binary encounters, but ternary and more complex 
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encounters also, has been developed by Bogoliubov [225], Born and 
Green [226], [227], [228], [229], Kirkwood [230], [231], Yvon [232] 
and others [233], [234], especially in the case of a plasma. It should 
be very useful to develop this theory for stellar systems also. This 
problem is intimately connected with the problem of the time of 
relaxation. Further the change of the distribution function due to 
encounters ( df/dt) c in Fokker-Planck’s equation may be derived 
more accurately by such a method. 

As we have indicated above, the rate of ejection of stars from a 
cluster is not yet accurately known. The theory must explain 
quantitatively the extremely low escape rate indicated by von 
Horner’s numerical calculations [9], [ 10 ] . Further, the ejection 
of stars of different masses, the effects of the rotation and ellipticity 
of a cluster on the rate of ejection must be derived. 

One very important problem is to find out if the stellar clusters 
tend always towards an homologous cluster. 

Another problem is to find criteria that would indicate the age 
of a cluster; the present estimates of the ages of the clusters are 
in general based only on the evolution of different stellar models. 
It would be very important to have independent dynamical ages 
of the clusters. 

There are many statistical problems concerning specific stellar 
systems, clusters, galaxies or clusters of galaxies. The statistical 
methods used in such problems are described, e.g., in the book of 
Trumpler and Weaver 1 1 10 ] . 

Statistical methods are introduced also in problems ot celestial 
mechanics, in connection with the distribution ot the asteroids, 
the comets and the artificial satellites around the earth (especially 
in the case of a great number of small satellites as e.g., the case 
of Project Westford). One recent example of this type, where a 
distribution function is used in celestial mechanics is given by 
Brouwer [235). 

The most important problem of statistical mechanics ol stellar 
systems at the present time is the justification of its basic as- 
sumptions and the development of new methods of attack on the 
basic statistical problems of stellar dynamics. This can be done 
with some success by numerical integrations of the n-body problem; 
the results reached this way will indicate the statistical methods 
to be developed in stellar dynamics. To this effect stellar dynamics 
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has to cooperate with classical celestial mechanics on one hand 
plasma dynamics on the other. Much work is needed before we 
can construct a consistent and somewhat complete statistical 
theory of stellar systems. 

I want to thank Dr. J. Barkley Rosser for suggesting many 
improvements in the text. 
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Harry Pollard 


j|67 14420 

Qualitative Methods 
in the tt-Body Problem* 


I. Introduction. The n-body problem is concerned with the motion 
of masses m u • • -,m n (n > 1), moving in inertial space under the at- 
traction of their gravitational forces. In the case of a particle m k 
being acted upon by a mass m ; , we illustrate the geometry in Figure 
1. 

With position vectors tj and r^, the differential equation of motion 
due to the force on the kth particle by the other masses is 


( 1 . 1 ) 


m k r k = 


n 


Z 7 

j 


rrijirik 

r% 


r j - x k 

r jk 


(k - 1, 


Assuming that the initial position and velocity are given, i.e., r^(0), 
v^(0) and r jk > 0, we seek a solution of (1.1). 

To realize what constitutes a solution to a differential equation, 
recall the problem 


dy 

dx 


f(*,y ), 


where we seek a solution passing through a predetermined point 


* This is a reporter’s account of Professor Pollard’s lectures, partially revised 
by the editor. 
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Figure 1 . Two Interacting Masses 

(*o> % Vo) • In general y — g{x,c) is said to be a solution depending on 
the parameter c if 

~ g(x,c) = f{x,g(x,c)). 

Obviously to find c we solve y 0 = £(jt 0 ,c). 

More generally, we may say that h(x,y,c) = 0 is the solution, 
meaning that if we determine g(x, c) implicitly by h{x f g(x t c), c) = 0, 
then y = g(x f c) is the solution in the earlier sense. For example con- 
sider 


with solution 


dy _ 2x + ye x> cos e xy 
dx Jte xv sin xv +1 ’ 


x 2 + sine xv -f y = c. 

The latter equation is a solution in the sense that if it is differen- 
tiated the former equation results. Actually such a solution serves 
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little useful purpose unless there exists some transparency that 
makes it more tractable than the example just given. 

Consider a set of differential equations 

j,- g{x ’ y) 

with initial conditions jc( 0) and y(0) given. The problem is to find 
solutions £ = x(0 and y = y(t) satisfying the differential equations 
and the initial conditions. Simple division of these equations elimi- 
nates the variable t and yields dy/dx= h(x,y) where y ~ g(x). Here 
we have managed to reduce the system by one, and there is a chance 
that if the solution is transparent the reduction is useful. In general 
it is not. Thus mathematicians were led to look for integrals to 
systems of differential equations. Returning to equation (1.1), the 
idea is to reduce it to a system of first-order differential equations 
of the form 

dr 

(i2) ax± = fk{Xli ... 9 x m ) (A — 1, •• -,m), 

where x k = x k (t) and jc*( 0) is given for k = 1 , • ■ - ,m. The order of (1.2) 
is m, with m = 6n. A function g(x u * • *, x m , t) is called an integral of 
the system if every solution of the system gives 

g(*i(t) y ---,x m (t),t) = constant, 

where the constant is determined by 

g(x x (t), --- y x m (t) y t) = ^(initial values). 

If we can find m integrals 

(1.3) g k (x x (t), — , jc m (0, 0 = ^(initial values) (k = 1, ***,m) 

they constitute an implicit solution of (1.2) in that they are m equa- 
tions in m unknowns for which we can solve x k = x k (t , initial condi- 
tions), and the problem is solved in terms of t and the initial condi- 
tions. 

II. Illustrative central force problem. To treat the two-body prob- 
lem, n = 2, m = 12, with masses moving in a field subject to the in- 
verse square law, we first look at a central force problem for one 
body, n = 1, m = 6, 
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(III) 

where r = | r| . 


nm r 

mr= -77. 

r r 


or 


r 



Figure 2. A Single Body with a Central Force 
Using Laplace’s method, we write 

d r rv — rr 


( II .2) 


dt r 


r? 


Recalling that r 2 = r 2 and r/* = r • r, (II. 2) can be written as 

, TT dr (rXv)Xr _ (r X v) X r 

dt r r 6 n 

Since r X r = 0 by (II.l), we have r X v = h, which constitutes three 
integrals. Using this in (1 1. 3) gives 

(H.4) i- . 

dt r a 
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( II .5) 


r _ - h X v 
r n 


e, 



v X h 

A 


which constitutes three more integrals. However, the problem is not 
complete since (1.2) has at least one solution of the form 


g*(*i, = constant, 

in which t appears explicitly. But the 6 integrals shown above con- 
tain no such function, implying that e and h are not independent 
of each other. In fact 


e • h = 0. 


Thus, in fact, (II.5) yields but 2 new integrals, with the sixth, 
called the time of perihelion passage, still missing. 

Turning to the larger problem of two bodies, initially at different 
points, we seek 12 integrals for the system (see (1 .1) and Figure 1): 


(II .6) 


™ - _ -)«i| m 2 (r! - r 2 ) 

ni\ rj — ? 

r h r 12 


(II. 7) 


^2 1*2 — 


- 7 m x m 2 (r 2 - r t ) 
^12 r 12 


Adding (II.6) and (II. 7) yields 


(II. 8) 


m 1 fi + m 2 r 2 = 0. 


Define M = m x + m 2 and r c (center of mass) = M~ l (m l r l + m 2 r 2 ), 
so that r c = 0. This last equation indicates the center of mass is not 
accelerating. Integrating to get the velocity and position of the 
center of mass, 


Vc = 1 (3 integrals, conservation of linear momentum) , 

r c = It + j (3 integrals). 

Multiplying (II.6) by X r t and (II. 7) by X r 2 and adding, we get 


"Ii(ri X fj) + m 2 (r 2 X f 2 ) = 0. 


Integration yields 
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(1 1. 9) m 1 (r 1 X VO + m 2 (r 2 X v 2 ) = h. 

These are three more integrals, expressing the constancy of angular 
momentum. Now multiply ( II .6) by • f t , (II. 7) by • f 2 and add. 


m, r, • r'i + m 2 f 2 • f 2 = 


d -y m j m 2 
dt r, 2 


Integrating, we have 

(II. 10) ^ (m, t;, + m 2 vj) = ym ' — + E. 

2 r 12 


This is an integral expressing the conservation of energy. 

Subtracting ( II .6) from ( II .7) to get the equation of motion of the 
second particle with respect to the first, we have 


(II. 11) 


f 2 ~ f i = 


t(™i + m 2 ), 

3 K*2 ~ *l) 

r 6 u 


Let r = it — ti and m = 7 M so that (II. 11) becomes 


( 11 . 12 ) 



This is the central force problem considered above, which yields 
five more integrals. Since we now have more than 12 integrals, some 
must be redundant. They can be reduced to the following: 


(11.13) 


V f 


r c 

m x {r x X Vj) + m 2 ( r 2 X v 2 ) 
+ m 2 vi) 


v 

r 


+ e 


1. 

(3) 

If + j, 

(3) 

h, 

(3) 

vm\ m, £ 

(1) 

ri2 


v X h 

(2) 




Returning to the problem of the time of perihelion passage, opera- 
ting on both sides of the last equation in (11.13) by • r we have 
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(11.14) r = 

/u(l + e COSw) 

which is the polar equation of a conic section with major axis along 
e, and a? the angle between e and r. 

Finally, squaring the last equation in (11.13) we have 


1 + 


2 

r 




) i 2 v 2 h 2 

~r\ +e 2 = — , 


— = — + 
2 r 


(e 2 — l)|x 2 

2h 2 


= ^ + £. 

r 


This is the conservation of energy statement, with hyperbolic mo- 
tion for e 2 > 1, parabolic for e 2 = 1, elliptic for e 2 < 1; for hyper- 
bolic or elliptic motion, we must have h ^ 0. 

Since | a * b| 2 + | a X b| 2 = a 2 6 2 , substituting r and v we have 
r 2 r 2 + h 2 = r 2 v 2 or 

("• 15 ) ^ + ^).i +E . 

But v 2 /2 — iix/r) + E is valid even if h = 0, since if h = 0, then 

r= — n/r 2 . 

Without integrating, (11.15) shows 
(11.16) 

2 r r 

Multiplying (11.16) by r 2 , we have 


(11.17) \h 2 ^nr+ Er\ 

From (11.17) we see that if r — *0, then h = 0. Analogously, in the 
n -body problem, all bodies cannot collide simultaneously unless the 
total angular momentum is 0. 

Multiplying (11.15) by 2r 2 and simplifying, we have 


(11.18) 


(rr) 2 + H 2 = 2 (fir+ Er 2 ). 
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In intervals when r ^ 0, we can introduce a variable u such that 
ii — k/r. Then 

dr dfi i f 

r ~ hd[’ rr ~ kr ’ 


if we let a prime denote differentiation with respect to u. Then (11.18) 
becomes 


(11.19) k 2 (r') 2 + h 2 = 2(nr+ Er 2 ). 

Differentiating (11.19) and dividing by 2 r' yields 

(11.20) k 2 r" = » + 2Er. 

For the case where E = 0 in (11.20), choose k 2 = n, so that r" = 1 and 


( 11 . 21 ) 


r = 


2 


+ au + b. 


Ast = l/k f rdu, we get 

(11.22) t = u" 1/2 (-| + Y+bu + c') 

and we obviously have parabolic motion. 

For E < 0, choose k 2 = 2\E\, yielding r" + r = n/k 2 . Thus 
r = n/k 2 + A cos (u — B). 

By proper choice of the constant of integration in 

u = J * dt 

we can take B = 0. So 

(11.23) r= j^r+Acosu, 

(11.24) t = t 0 + p-u + J- sinw- 

For E > 0, choose k l = 2 £, yielding 

(11.25) r = p- + A cosh u, 

f = (o + p“+ j- sinhu. 


(11.26) 
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Let us now discuss two basic problems of interest in the two-body 
problem: 

(1) For those orbits in which the masses are separating as t — ♦ oo, 
how large is r? 

(2) If h — 0, and for some time t = t ly a collision occurs, how small 
is r? 

For problem (1) we obtain from 

(11.21) and (11.22), r - * 2/3 , for E = 0, 

(11.25) and (11.26), t, for E > 0, 

(11.23) and (11.24), r bounded, for E < 0. 

Now consider the problem when h = 0 and collision occurs at 
t = t x . In short, in what way is r related to (fi — t) as From 

x = — ii/x 2 f multiplying by x and integrating, we have x 2 /2 = n/x 
+ E. Multiplying now by jc and taking the limit as x — * 0 or t — > t u we 
find 

(11.27) limxx 2 = 2/t. 

Assume there exists an a such that x ~ (^ — t) a . Substitution in 
(11.27) yields 

aHt-hnt i-o 2 " -2 — 2 M) 

(<i — t) 3a ~ 2 —>2n/a 2 , 

x ~ (ti — 0 2/3 as t — > <j. 

It is easy to make this result rigorous. 

III. Introduction to the n-body problem. An interesting property 
(see Bertrand, [l]) is that if one has a particle in a circular orbit 
and the initial conditions are changed slightly, only the inverse 
square law (fi/r 2 ) and the linear law (/*r) will yield a new closed orbit. 
In the solar system under a linear law, the planets would move in 
elliptic orbits with the sun at the center, and with a common period. 

Let us first discuss the ra-body problem under an arbitrary law 
/(r). The equations of motion become 

m k f*= X m j m kf{r jk ) Tj - ■ 

*jk 


(III.l) 
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Summing (III.l) over all values of k , 

n 

(111.2) I>*r* = 0 . 

Again using the concept of the mass center, with M = mi + m 2 
+ b m n , 

r c = ^ (m^i + m 2 r 2 H ), 

we get 

f c = 0, 
r c = 1, 

r e = \t + j. 

It simplifies matters if we change to a new coordinate system in 
constant linear motion with respect to the first in which 

(111. 3) f c = f c = r c = 0. 

The equations (III.l) are unchanged, so we have 

n 

(HI. 4) I>*r* = 0, 

k=l 

n 

(HI. 5) I>*v*= 0 . 

k= 1 

The order of the system has now been reduced to 6 n — 6. 

If f(r) = 7 r, (III.l) can be reduced to 

( III .6) r* = — yMr k , k=l 

( 111 . 6 ) implies all the masses satisfy the same differential equa- 
tion, but fails to recognize that perhaps two of the masses may col- 
lide. From (III .6) , the motion is elliptic or linear, and 

(111. 7 ) r* = A* cosa )t + B* sin cut, cu = \J (yM). 

For the situation f(r) = y/r 2 where n = 2 or 3, if the solution to 
(III.l) ceases to be analytic at some time t = t u a collision has oc- 
curred. For n > 3 the problem remains unsolved. 

However, Painleve has shown (see [3]) that if in some finite time 
t = t u a singularity occurs, then 
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minr y * = 0, 

where we have n(n — l)/2 distances r Jk . 

Returning to (III.l), crossing by r* and integrating we get 

n 

(IH.8) Y m *( r * X v*) = h, 

A=1 

subject to (III. 4) and (III.5). The system has now been reduced to 
6 n — 9. 

For the final reduction, define n(r) such that ^'(r) = — /(r). Form 
the self potential 

(III.9) U = Y m j m kti(r jk ). 

1 ^j<k^n 

Relating (III.9) to (III.l), we have 
(HI. 10) m k f k = grad* U . 

Multiplying (III. 10) by r* and integrating, 

(III.11) lYm k vl = E + U. 

* A = 1 

This embodies the conservation of energy. 

To specify the constant, and reduce the system to 6n — 10, for 

fir) = -pi, let fi(r) = J f(r) dr, 

f(r) = y r, let /i(r) = - jf f(r) dr, 

fir) = p, let /i(r) = - f fir) dr. 

IV. The Lagrange- Jacobi identity. Let us define I by 
(IV.l) I=lYm k ri 

* k=l 

Observe that this is half the usual moment of inertia. Since 

n 

21 = Y m * r * • r * 

fe=l 
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(IV.2) 


n 

i = £m*r*-v*. 

*-.i 


Therefore 


(IV. 3) 


r 


n 


Z m *( r * • f* + V* 2 ) 
A=1 


n 


= 2T+ Y, m ***- r*. 

A=1 


But 2 Xi m * r* • f* = ZXi r* • g rad * so we have 

n 

(IV. 4) r=2T+ £r*.grad*!7. 

k=i 


A function f(x u • * •, jc m ) is homogeneous of order k if for 0 < A 


(IV.5) /(AXj, •••,Xx m ) = \ k f(x u ---,x m ). 

Differentiating (IV.5) with respect to X, and letting X become 1 we 
get 

(iv.6) ix,^ = y. 

1 ° X 8 


This is precisely the expression for the coordinates in (IV. 4) if U is 
homogeneous of order Z. i.e., 


(IV - 7 > t.( x ^w: + Y ‘w: + z ‘§r)- 


III 


Thus 


(IV.8) r=2T+lU. 

This is known as the Lagrange-Jacobi identity. Consider the ef- 
fect of letting f(r) = 7 /r p , — 00 < p < 00 . 

In the previous section, p(r) was defined as follows: 

mW = J f(r)dr , 

with the following choice of limits. 
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If P > 1, n(r) = y f % = — — - • yr x p . 

Jr r p p - 1 

j -a 

— t-v*-'’. 

or p — 1 

f r dr 

If P = 1 , M (r) = - 7 — = 7 log (l/ r )- 

Ji r 

Thus, if p < 1 or p > 1, /z(r) and consequently U(r) is homogeneous 
of degree (1 — p). (Recall (III.9).) 

Applying the homogeneity property to (IV.8), 

(IV.9) I = 2T + (1 - p)U 

for f(r) = yr~ p f p ^ 1. 

Since by (III.ll) we have T — U + E and U = T — E, (IV.9) 
becomes 

/= 2U+2E+ (l-p)U 

(IV.10) 

= (3-p)U+2E= (S-p)T+(p-l)E. 

We are now in a position to discuss qualitatively the relationship 
between J, U, T and the general geometry of the problem. 

Let 0 be the mass center of the three-body system shown in 
Figure 3, and define 

m 2 



Figure 3. Three Interacting Masses 



272 


HARRY POLLARD 


R(t) = max r jk (t), 
ftt) = max r*(f), 
r(t) = min r,*(0, 

where these functions by their very definition are not necessarily 
analytic. The use of inequalities will enable us to relate I to these 
new functions. By (IV.l) 

n 

2 / = X m k r 'i . 
k = l 

By definition, each r* g f 2 (/) 

n 

21 g X m h t 2 , 

*=1 

or 

(iv.ii) 

Similarly, = m ^ r l = m f‘ where m = min m k . Combining 

this result with (IV. 11), 

(IV. 12) 

Inequality (IV. 12) tells us / and f 2 are of the same order. 

Now how is I related to /?? An alternate form for /, valid for the 
case where the center of mass is fixed, and useful in this treatment, 
is derived as follows. For a given j 

n n n n 

(iv. 13) X m *( r * - t j ) 2 = Z m * r * + r? X - 2 X m * r * • 

k=\ (fe-1 * = i k=\ 

But with a fixed center of mass, the last term in (IV. 13) vanishes 
by (III.4), and 

n 

(IV. 14) X "i*(r* — r,) 2 = 2/ + Mrf. 

*= 1 

Multiplying (IV. 14) by ntj and summing with respect to j, we finally 
get 

(IV. 15) ~ X m jk r%= I (m jk = m j -m k ). 

1 - 7 * k^n 
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But from the definition of R(t), 


I = R2 ^ ^ mjk ° r simply 

(IV. 16) 

I ^ AR 2 (t) where A is a constant. 

Treating (IV. 15) as we treated (IV. 1) to get (IV. 12), we get con- 
stants A and B such that 


(IV. 17) BR 2 (t) ^1 £AR 2 (t), 

so I is of the order of R 2 (t). A general conclusion is that if for t—^a 
one of the quantities I, R or f — ► oo, they all — » oo. 

Let us now show the relation between r, U and T for 1 < p < 3. 
Since r jk ^ r, 1 /r jk ^ 1/r, and from (III.9), 


B 


y m jk 


(IV. 18) 


A 

r p - ] 


A conclusion from (IV. 18) and the preceding work is that if as t^a 
one of the quantities 1/r, U, T or 7 approaches oo, they all do. 

It is impossible for all the bodies to collide simultaneously after 
an infinite time. To prove this statement, assume the contrary, 
i.e., all r k — 0 for £— > oo, which implies /?(£)—> 0 for t—> oo. 

r^ 0 implies 1/r— > oo, which implies I—* oo by (IV. 18) and (IV. 8). 
If 7 — oo, at some time 7 > 0. For simplicity let 7 > A for some t and 
with A > 0. 7 > A, integrated twice, yields 

A / 2 

(IV. 19) /> + C,t+ C 2 . 

But (IV. 19) tells us that / — oo which implies R — oo from (IV. 16). 
But this is contrary to our hypothesis. 

Consider the problem of fewer than n-bodies colliding after some 
time t= a. Let n = 3, /(r) = y r~ 2 . Then Chazy proved (see [2]) 
that it is impossible for a particular pair of masses to collide as 
t— * «=> if there exists a quantity b >0 such that both remaining dis- 
tances are always greater than or equal to b. That is, r 12 ^0 as 
t~^ ooif there exists <5 such that r 23 and r 13 ^5 > 0. (See Figure 3.) 
We shall prove directly an extension of this result, namely that if 
n = 3, f(r) = yr~ 2 , then r(t) -^0 as ^ 

Proof. Assume r— >0. This implies there exists an r jk — >0 for some 
particular j and k. If no particular r jk becomes and remains the 
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Figure 4. Behavior of r jk 

minimum r jk , this implies at least two r jk are alternately the mini- 
mum. Let them be r 12 and r^. Then when they exchange positions, 
i.e., r 12 < r^— ►T jb < r 12 , there exists a time t m such that r 12 = = r. 

With an infinite number of interchanges, we have r 12 = = r an 

infinite number of times. However, by Figure 3, if r 12 = = r, then 

r 13 ^ r 12 + = 2r. So iZ ^ 2r an infinite number of times. However, 

the reasoning used to derive (IV. 19) tells us that if r — >0 then iZ — > 
Then we cannot have R ^ 2r indefinitely often as r— >0. Thus if 
r(t) — >0, a fixed r jk will eventually become and remain the r(Z) of our 
definition. 

From (IV. 15) and previous results such as (IV. 19), assuming it is 
r 12 that becomes the minimum so r 12 — >0, we have as r— >0(l/2Af) 
(^23^23+ rn l3 rl 3 ) ^ At 2 . Assume ^ m 13 . Then 

(IV. 20) ^23(^23 **13) = At 2 , t\ 3 + Tj 3 > Bt 2 

Now let us show both and r 13 are greater than some multiple of t. 
Suppose t is large. Suppose ^ r 13 . Then by (IV.20) 

^ ( y/ (B/2))t. As — r 13 ^r 12 — >0, we can conclude r 13 ^ \/ (B)t/2. 
If 13 ^ r^, we can make a similar argument. So for large t 

(IV.21) r ^y/(B)t/ 2, r 13 J; V(B)t/2. 

From 

m 3 f 3 = (n - r 3 ) + (r 2 - r 3 ), 

13 r 23 
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Integrate (IV. 22) between t = t x and t = t 2 , t t < t 2 . 

X ** | * X |f,|< " sc (X- 5) • 

or 

(IV. 23) |v 3 0 2 ) - v 3 (<i)| = C (7 7-) • 

\ h Hi 

So v 3 (^) — v 3 (^i) — > 0 as and t 2 — > 00 . However, Cauchy proved that 
if there exists an f(t), t> 0, such that f{t x ) — f(t 2 ) —> 0 as t x and 
k — > °°, then lim exists. Applying this to v 3 (f), lim f _ a> v 3 ( 00) 
exists. 

Now let t 2 > ex?, and ti = t. Then (IV. 23) gives for large t 
(IV. 24) |v 3 (oo) - y 3 «)Uy - 

Integrating (IV. 24) gives 

j x [v 3 (t) - v 3 (oo)]dt | ^ I V 3 (0 - v 3 (oo)| dt ^ Clog*. 

(IV.25) |r 3 (t) - VaMt + CJ ^ Clog*. 

Divide (IV.25) by t and let t— > Since log t/t and c/t—>0, 

(IV. 26) lim-^- = v 3 (oo). 

By (III. 4) ^iirriiTi/t) = 0, and so 

(IV. 27) limM + M = = _ m 3 v 3 («>). 

( — * °° ^ t— ® ^ 

Recalling r 2 — ri— >0 and multiplying by 

(IV. 28) mj y — /7Z 1 ^ >0. 

Combining (IV. 27) and (IV.28) we now conclude that (m 1 + m 2 ) 
(r 2 /t) y {ri/t) and (r 2 /t) all have limits. Since 

I _ Tn x r\ m 2 r\ m 3 r| 

* 2 * 2 ^ t 2 ^ t 2 9 



! 
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we now have lim^ I/t 2 exists and is finite. Dividing by t 2 and 
letting t—>c o, (IV. 19) becomes 



However, in (IV. 19) we could take A greater than any given quan- 
tity. This gives a contradiction. Therefore 0 as t — ><». 

In the n - body problem, a simultaneous collision of all n bodies 
implies that the total angular momentum is zero (h = 0). 

Proof. Such a collision implies R(t) —* 0. We have previously 
shown R 0 after an infinite time, so there exists a time t = t\ < °o 
at which the collision must occur. 

According to (III.8) 

n 

h = X m *( r * X v *)> 

k=l 

so that 

(IV. 29) h ^ m k r k v h . 

k 

The Cauchy inequality states that 

lZaf>| 2 ^(2> 2 )<X& 2 ). 

and since (IV. 29) can be written as 


h ^ X (( V(™*) r k )( V(m k ) v k )), 

k 

we conclude 

(IV.30) (X m * r *) (X m * y *) = 4/T - 

Using T = I — E, from (IV. 10), we change (IV.30) to 
(IV.31) 

But at some time t { , I — >0. Then by (IV. 17) R >0, and so r — 0. 
Therefore, / — • <», which implies that at some time 7 > A > 0. So the 
plot of / against t must be concave upwards. But this means I < 0, 
or — / > 0. Multiplying (IV.31) by — ///, 


h 2 ( - /) 


g - 4/(7- E). 


I 
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Integrating this in the neighborhood of t u 
( I V .32) h 2 log -J. ^ 4EI - 2/ 2 + K £ 4EI + K. 


As 1//— >oo, log (1//) — * oo, and so is eventually >0. Dividing 
(IV.32) by log (1/ /) , 


h 2 ^ 


4£/+K fort 

logy 


^ 1 * 


Now as t — the denomiator —> 00 , /— >0 and the numerator — 
Thus A 2 — >0 as f — or the total angular momentum vanishes if 
there exists a simultaneous collision of all masses. 


V. A Tauberian Theorem and its application. Consider the problem 
of a given function /(x), x>0 such that f(x) ~ Ax 2 , i.e., 
lim*^*, f(x)/x 2 = A , and let us question if it is true that f'(x) ~~ 2 Ax. 

The converse statement (that if f'(x) ^2 Ax, then f(x) ~ Ax 2 ) 
is true, but it is not necessarily true that given an asymptotic rela- 
tion one can differentiate with the result being also an asymptotic 
relation. It is this irreversibility that led to the concept of the 
Tauberian condition, which is that additional information required 
to obtain reversibility in the above limits. 

We first prove that if f'(x)^2Ax, then f(x) ~ Ax 2 . If 
lim x _ od /'(x)/2x = A, then for each c > 0 there exists an x 0 such that 


(V.l) 



< c for x x 0 . 


Multiply (V.l) by 2x, integrate with respect to x, divide by x 2 and 
let x— > 00 . 


(V.2) 



for some constant C. Since e is arbitrary, let e— >0 so (V.2) leads to 


(V.3) 


lim = A or /(x) — Ax 2 . 


Now, to show that if f(x) ^ Ax 2 then f'(x) ^ 2Ax we must introduce 
a Tauberian condition. 
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Landau’s Theorem (1906). If /( x) — Ax 2 , and f"{x) ^ C > — 00 , 
then f'(x) ~ 2 Ax. 

Case 1. A — 0. Consider f(: c + ejc), and its expansion in Taylor 
series with a remainder, up to second-order terms. 

2 2 

(V.4) f(x + tx) = /(x) + txf'(x) + /"(£), 


where £ lies between x and x + ex. Thus 

2 2 

(V.5) f(x + «x) ^ f(x) + txf'(x) + ■ C. 


Dividing (V.5) by x 2 and taking x large enough so that 

I /(X + ex) . 


&--A 

S t 2 and 

x 2 



(x + tx) 2 

and recalling that A = 0, we have 

(V.6) (l +(1+|*| ) 2 “f)- 


= < 2 > 


In (V.6), for t > 0, divide by < 


(V.7) *«(l + (l + |e|) 2 - . 

In (V.6), for t <0, divide by < (reverse inequality) and (V.6) be- 
comes 


(V.8) 


f'(x) 

X 


£<(1 + (1 + m ) 2 - 



Clearly (V.7) and (V.8) imply 


lim 

X— .00 


/'(*) 

2x 


= 0 . 


Case 2. A ^ 0. Define g(x) such that 
(V.9) «« - - A. 

The hypothesis /(x) ~ Ax 2 implies g(x)/x 2 — >0. From (V.9), g"(x) 
= f"(x) - 2 A. 
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If f"(x) ^ C > — co, then g"(x) ^ C - 2A > — °°. Hence the 
argument for the case A = 0 now applies to g{x), so 


(V.10) 

By (V.9), 

(V.ll) 


lim 


g'(x) 

2x 


= 0 . 


g'(*) = f'(x) 
2x 2x 


Using (V.10) and (V.ll), f'(x) ~ 2 Ax. 

The same conclusion can be reached for f" (x) ^ C < co by using 
— / for f in the above argument. 

von Clausius’ Theorem. If we assume the inverse square law, 
and a system is bounded in size and vebcity, then both the kinetic energy 
and potential energy have averages in the limit sense. Specifically 


(V.12) 

tl= lim - f 
t Jo 

Udt = - 2 E, 

(V.13) 

T = lim - 

l , cp t 

f Tdt = —E. 

to 


Proof. If (V.12) is true, (V.13) follows, since T = U + E by 
(III. 11), and since E is constant it is equal to its average value. 
Thus T = U + E. It other words, (V.13) is redundant in view of the 
conservation of energy. 

Now to establish (V.12). Begin with the Lagrange- Jacobi identity 
I = U + 2 E, integrate once and divide by t. 

(V.14) 7 = 7 f Udt+2E+— . 

t t Jo t 

Remembering (IV.2), from the hypothesis of bounded r* and v*, / is 
bounded in time. Thus, as t — * °o, I ft— >0, K/t—>0, and 

U = lim - f Udt=— 2 E. 

Moo ^ JO 


Due to the fact that in some cases, e.g., the parabolic case of the 
two body problem, U = — 2E(= 0) even for an unbounded system 
(r ~ t 2/3 ) , it is possible to prove a stronger theorem. 
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Theorem. A necessary and sufficient condition that U exist and 
equal — 2 E is that lim^*, R(t)/t = 0. 

Here no condition is put upon the velocity, and the system can 
grow with time provided it grows at a slower rate than t. 

To prove the stated theorem, let us first show that //f— ► 0 if and 
only if //£ 2 — ► 0. The argument given before our proof of Landau’s 
Theorem shows that if ift—> 0, then 7/f 2 — > 0. 

We can get the reverse by Landau’s Theorem if we can show 
/ > C > — oo. However, by the Lagrange-Jacobi identity t = U 
+ 2E > 2E > - co. 

By (V.14), U = — 2E if and only if //£— >0. So U — — 2E if and 
only if I / 1 2 — + 0. Then by (IV. 17), U — — 2E if and only if R(t)/t^>0. 
Thus our theorem is proved. 

Theorem. If T exists and equals 0 , then E — 0 . 

Proof. T = U + E, so if T exists, so does 0, and T — U + E. As 
T = 0 by hypothesis, U = — E. But U ^ 0, so — E ^ 0 or 

(V.15) E ^ 0. 

From 7 = T + £, integration once and division by t gives 

<V16) f - 7 X Tdt + E+ T- 

Ast—> if T = 0, the integral in (V.16) must vanish, as does C/t. 
Thus: 


lim — = E so / ~ Et. 
t 

Integration of this asymptotic function gives 

, Et 2 

2 ’ 


or 



E_ 

T' 


But / > 0, so E S 0. Combining this result with (V.15), we have 
E = 0. 

Incidentally, the theorem is true for all p 1. 
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What happens if E < 0? Since T = U + E, and T ^ 0, by hy- 
pothesis 

U+ E^O or - E. 

Thus U ^ \E\ , but A/r ^ U by (IV. 18), so A/r ^ |Z?|. Thus 
A/\E\ ^ r. So if E < 0, then r is bounded but R could conceivably 

* OO. 

If E > 0, there are some interesting conclusions to be drawn. 
Theorem. // E > 0, n = 3, a particle escapes. 

Lemma 1. \EI — I 2 < C as t—> co. 

From the Lagrange- Jacobi equation / = 2E + U, we conclude 
/— > oo as f — > co. Thus, for some time at becomes positive, say 1(a) 
= k > 0. By integrating /, 

(V.17) 7= J I7(r)<7r + 2£(t - a) + k. 

(V.18) 7 = J « - r) I/(t)<7t + £(t - a) 2 + - a) + /. 

Multiplying (V.18) by 4 E, we can write 

(V.19) AEI < 4 E(t -a) £ U{r)dr + A E 2 (t - a) 2 + 4 Ek(t - a) + C. 
Squaring both sides of (V.17) gives 

( V.20) 7 2 > 4E 2 « - a) 2 + 4 E(t - a) j U(r )dr + AEk(t - a) . 

Subtracting (V.20) from (V.19) gives 

AEI - 7 2 < C. 

Define J = 

A familiar argument gives 

(V.21) BR <J< AR. 

Lemma 2. lim,^ l/UJ exists. 

From the definitions, 
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J7J = X! — m « r M> l^j<k^n, 1 g:P<q^n 

r jk 

= . 

r Jk 

Differentiating this with respect to time, 


( v.22) {ujy = ZZ ym Jk mpq ^ /"*> 


But rj k ^ r by definition, so 1 //>* ^ 1/r. From (V.22) and (IV. 18), 
with C = B 2 , 

( V.23) \(UJ)'\ ^CU 2 ZZy V(m jk m„) y/(m jk mj | r jk r„ - r„ r jk \ . 

Square (V.23) use Cauchy’s inequality, and note that in the expan- 
sion of | | 2 we get a middle term 

- 2 X m jk r jk tj > H rn pq r pq = - 2 M 2 ! 2 . 

Thus (V.23) becomes 


| ( UJ ) ' | 2 ^ U 4 (CJ 2 U + CO 
Dividing by U b J b , 


(V.24) 


\jjjjy\l < r 

U b J b = J 3 + J 4 


(J 2 > I 2 ). 


1 

UJ 


But by (IV. 18) and (V.21) 1/1/ ^ Cr and 1/J ^ C/R. Thus 1/ UJ 
^ Cr/ R ^ C. This implies 1 / UJ is bounded, and it will be dropped 
from the right side of (V.24). Similarly, since E > 0, if / > At 2 , then 
J 2 > At 2 , and so J > At. Finally (V.24) can be reduced to 

\(UJ)'\ <_B 

(UJ) b/2 = t 3/2 ' 

Integration of this gives 


1 

1 

g B 

1 

1 

(UJ)f /2 

(UJ) r 

t\ 2 ~ 

*1/2 

^2 


But as<! and t 2 — * <» independently, the right side of (V.25) vanishes, 
and 1/(UJ) 312 — »lim as t — * <» or lim,_„ 1/UJ = l. 

Assume / > 0. Then, there exists a 5 > 0 such that 1/ UJ ^ 5 > 0 
as t — * 


OO, 
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( V.26) 1 7 ^- > 5 > 0, so r S f>R ^ Ct, 

n UJ 

(V. 26 ) shows that as f— > <», r— * oo so all the particles escape. 

Assume / = 0. Then at some time, some />(say r 12 ) becomes the 
minimum, i.e., r 12 = r. If another r jk swaps with r 12 , each time a new 
r jk becomes the minimum, r 12 — r^. (Assuming r 23 is the other mini- 
mum.) Then r 13 — R. But 

ki 2 -r 13 | 

(V.27) or 

|r — R\ ^r. 

Division of (V.27) by R, gives \r/R — 1| g r/R. But lim,^„ r/R = 0, 
which implies | — 1 1 ^ 0. Therefore there exists a min r jk ; call it ri 2 . 
Introduce Jacobi coordinates ( and r, where 

„ /niTi + mj r 2 

< = r 3 — — ’ 

m i + Tti 2 

r = r 2 -r ls 
i + m 2 r 2 + m 3 r 3 =0. 

The above set of equations may be solved to get r x , r 2 and r 3 as 
linear functions of r and (. Such a manipulation would show 

(V.28) / = A{ 2 +Br 2 , 

where A and B are functions only of the masses. 



Figure 5. Jacobi Coordinates 
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Let r 23 =fi, r 12 = r. Then \R-£\^r, or |1 - it/R)\ ^r/R. 
However, by (IV.18) and (V.21), l/U^Cr and l/J^C/R. So 
r/ R ^ C/ UJ. With l = 0, this gives £//?—> 1, £ — R- 

Since R ^ Ct, { ^ Ct so £ becomes unbounded as f — 1 ► 00 . But re- 
writing the form for £ 

Afra-^m.-r,- _ M 

s — r; — r :* — cr ^* 

M ~ m :] M — m A 

Since, as t ~ » oo, £ — oo, we conclude r 3 — * oo. This means m 3 escapes 
from the system. 

VI. Classification of motions. Recall the general equation of motion 

(VI. 1) m k f* = X mj-^- (r, - r*). 

When /(r,*) is a real, analytic function there exists a unique set of 
r k (t) which satisfies (VI.l). For the case where f(r) = y /r 2 , either all 
the r k (t) may be continued analytically as f— » oo, or there is some 
time t = t x at which at least one r k ceases to be analytic. Painleve has 
shown (see [3 j ) that the solution of the rc-body problem permits 
analytic continuation until such time t = t x for which r(f) — • 0 as 
t — * t x , and that this condition is both necessary and sufficient. 

Returning to the work of Chazy in [2] for the case where f— > 
is it possible to find estimates for the growth in r(t) with time? 
From the definition of U, 


yWijk fjk 

r fk 

From (IV.18), with C = B ' 2 , 

(VI.2) \U\ ^ CU 2 Y. ym jk \f jk \ . 

As before, we square both sides of (VI.2), recalling the energy equa- 
tions, to get 

(VI. 3) \U\ 2 ^CU 4 T. 

(Note that in this paper no effort has been made to distinguish 
between the various constants, i.e., A, B, C, C u etc., since they only 
depend on the masses.) As U = T — E ^ - £, we see that if E < 0 
then 
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(VI. 4) 


— ^ \E\, 

r 



If E = 0, T = U and (VI. 3) becomes 


(VI.5) 


\u \ 2 ^ cu b , 


\U\ 

u 52 


^ B. 


Integration gives U 3/2 S B x t + B 2 , so that U 1 ^ Bt 2,i for large t. 
As A/r ^ U, 

(VI.6) r ^ Ct 2ri . 


If E > 0, (VI.3) becomes 
(VI.7) \U\ 2 ^CU 4 (U+ E). 

Let £ = U~ l , so that from (VI.7) we get 

Ur'T ^ cr 4 (r‘ + £) 

C+CEi\ 


(VI.8) 




\ 


£ 




Choose A large enough so that 


(VI.9) 


A > 2 


C + CEA \ 




We wish to show that for sufficiently large t 0 
(VI. 10) £ ^ At 

for all t ^ t 0 . We first suppose that for all t^t u £ ^ A. Then by 
(VI.8) and (VI.9), 

(VI.11) |£| ^ | A 


so that (VI. 10) will eventually hold. Alternatively suppose that ior 
every t x there is a greater t at which £ < A. Take t 0 to be such a t > 1. 
Then we prove (VI. 10) by reductio ad absurdum. Suppose (VI. 10) 
fails. As t Q > 1 and £ < A at t = 0 , we have (VI. 10) holding at t = t 0 , 
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and consequently for t 0 ^ t ^ t 2y where t 2 is the greatest lower bound 
of all V s greater than t 0 for which (VI. 10) fails. Then £ ^ At 2 at 
t = t 2 , and so (since t 2 > t Q > 1), we must have £ ^ A for t 2 — t ^ t 
s; t 2 . Then by (VI. 8) and (VI.9), (VI. 11) must hold for t 2 — c ^ t ^ t 2 . 
With £ ^ At 2 at t = t 2 , this tells us that £ > At at t = t 2 — con- 
tradicting our definition of t 2 as the greatest lower bound of f s 
greater that £ 0 for which (VI. 10) fails. 

From (VI. 10), we get Cr ^ t/' 1 = £ ^ 

Thus 

(VI. 12) r^t. 

For n = 3, and ruling out the case h = 0 of triple collision and 
writing l = lim^ ro (UJ) Chazy gave the following possibilities 
(see [2]): 

E > 0 all r u - 1 hyperbolic case 

two rij s, say r 12 
and r n ^ t 
r is ^ t m 

two r,/s, sayr 12 
and r 23 — i 
^13 g fi 

E = 0 two t\/s, say r 12 

and t 

ris ^ B 

all r i} - * 23 

All the above 
cases. In addition 

E < 0 a case in which 
some Ty oscil- 
lates infinity. 

Chazy was able to exhibit orbits of every type above save for 
the very last, but the latest Russian literature indicates this too has 
now been exhibited. 


hyperbolic- 
elliptic case 


parabolic case 


l = 0 hyperbolic- 
elliptic case 



287 


QUALITATIVE METHODS IN THE rc-BODY PROBLEM 

Let us now return to the Sundman problem, namely the three- 
body problem where h ^ 0, and show that if r— >0 as t— >t h this cor- 
responds to a two particle collision with the third particle moving to 
a definite position with a definite velocity. 

If r(£) — *0 as t-+t u U ° o so /— * °°. But this tells us that the 
curve of I against t must be concave upward in the neighborhood 
of t = t u so I— >L, where 0 ^ L ^ <». We have already ruled out the 
case where I = 0, so we have 0 < L ^ co. 

If r— >0, a pair of particles collide. Then one of the distances r i} 
becomes a minimum and remains so. To prove this, assume r 12 and r < ^ 
are alternately the minimum. Then there exists a sequence of times 
\t n \, where r l2 (t n ) = r 23 (t n ) -+ 0. But r 13 ^ r l2 + r*, so r l3 (t n ) 

— >0 which implies all three r l} — > 0 along this sequence of t n 9 s. 
But note that by (IV. 15) / = 1/2 M^m jk rj k , so /(£„)— >0 along this 
sequence. 

We have already proved I has a limit, so if it approaches 0 
along a particular sequence of times, it will approach 0 no matter 
how you approach t x . But this would be a triple collision, contrary to 
h^0. Thus only one r i; eventually becomes and stays the minimum. 

Now to show that the third particle moves to a definite position 
with a definite velocity. If r 12 — ►(), then — r 13 — *0. We know 

in 12 r\ 2 — * 0. This implies by (IV. 15) that m^r\ 3 - f- m 3x r\ 3 ^2ML > 0. 

As — r 13 — >0, we must have both and r 13 approaching the 
same finite limit, which must be positive, or going to °° together. 
But 


*3 = 


2? <r, ~ r t > + 25! 


r 13 


r 3 * 


(r 2 - r 3 ), 


so 


Iftl ^ 


y m i , ym 2 


r \ 3 


*23 


As and r 13 are bounded away from zero, we see that r 3 remains 
bounded. Recalling that if a function has a bounded derivative as 
the function itself has a limit, we see that r 3 has a limit as 
So f 3 is bounded, and r 3 — ► limit as t— >t lm 
However, m l h + m 2 r 2 + m 3 r 3 = 0, so that m l + m 2 r 2 has a 
limit. Also, r 12 — *0, ri — r 2 — >0, from which we conclude that r! has 
a limit, which is the same limit that r 2 has. That is, r 1} r 2 , and r 3 all 
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have limits, which says that the particles approach definite positions. 
Also, by (IV.l), we conclude that L is finite. 

VII. The Sundman problem. Among the theorems proved by Sund- 
man (see [4]) are the following, where n = 3, we have chosen the 
units so that f(r) = r~ 2 , h ^ 0, and r(t) — *0 as t-^t x < <». 

Theorem 1. lim,_ fl /(f) = L, 0 < L < oo. 

Theorem 2. Two particles collide and the third particle goes to a 
definite position with a finite velocity . 

Theorem 3. If v t and v 2 are the velocities of the colliding particles, 
then 


lim r(t)v‘i = 2 m 2 / ( m i + w 2 ) , 

'-'l 

lim r(t)v 2 = 2m 2 / (mi + m 2 ). 


Theorem 4. The integral 

J U(r)dT, converges (cf.h = 0 in two-body case) . 

Theorem 5. // V is the velocity of m { relative to m 2> then at colli- 
sion 

lim r V 2 = 2 (m x -f w 2 ). 

' *'i 

Theorem 6. lim, tfl r(d 2 /A 2 )(^ 2 ) = 2(m 1 + m 2 ). 

Theorem 7. As f — t u r ^ (f, — t) 2/; \ 

We undertake to prove these. We have already proved Theorems 
1 and 2. As in their proofs, we take r = r 2 — ri. By definition 


IJ- 


| m 23 m 31 

^12 r 23 r 3 i 


Multiply this by r and let f — Ji, to obtain 


(VII. 1) 
From this 


lim rU = m 12 . 


r(T- E) — m 12 . 
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But E is constant, r— » 0 , so we can write rT— >m 12 , or 
r(nt l Ui + m 2 v\ + m 3 i;|) — >2m 12 . 

However, V 3 approaches a limit, so that 
(VII. 2) r(m l Vi) + r(m 2 v 2 ) —*2m l2 . 

The center of mass is such that 
(VII.3) m x y x + m 2 \ 2 + m 3 v 3 = 0 . 

Let us multiply (VII.3) by >Jr, recall that v 3 approaches a limit 
V>— 0 , and combine (VII. 2 ) and (VII. 3 ) to get 

(VI 1. 4) rVi — > - 2w * , 

m 1 + m 2 


(VII. 5) rv 2 — > — — — — — , 

m 1 + m 2 

From (VII.3) we get 

rmi Vi + rm 2 y l * v 2 + rm 3 Vi * v 3 = 0. 

As V(^)vj is bounded by (VII.4) and t > 3 approaches a limit, we 
conclude by (VII.4) 


r\i • v 2 


— 2m i m 2 
m x + m 2 


So by (VII.4) and (VII. 5 ) 

(VII. 6 ) r| y x — v 2 | 2 -^ 2 (mi + m 2 ). 

Theorems 3 and 5 follow from (VII. 4 ), (VII. 5 ), and (VII. 6 ). 
By (IV. 18), Theorem 4 will follow if we can show 


lim 



— — is bounded. 
r(r) 


Since l/r ^ U = t — 2E , I — * 00 as t—+t u so at some point I becomes 
positive. Thus, if we show the existence of a limit for f l /dr, we have 
a limit for /*'dr/r(r). Now, by (IV. 1) 


/ = m^i*! • vO + m 2 (r 2 • v 2 ) + m 3 (r 3 • v 3 ) 


= r l (m l Vj + m 2 v 2 ) + m 2 (r 2 - ri)v 2 + m 3 r 3 * v 3 . 
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So by (III.5) 

i = m 2 r • v 2 + m 3 ( r 3 - r x ) • v 3 . 

We showed in §VI that r x , r 3 , and v 3 have limits. Also 
I r - v 2 ( ^ \/r \A v 2 

so that r • v 2 — *0 by (VII .5) . This proves Theorem 4. 
We have 

(VII.7) J (r 2 ) = 2r-f+2/' 2 . 

By (VI.l) 

r = ^ (r 2 — ri) + bounded term, 

n 2 


r 2 = ^- (ri — r 2 ) + bounded term. 
r 12 

Subtract the first from the second, 


r = — mi m<l r + bounded term. 


So, 


r • r 


Wl i “b Tfl 2 


+ vanishing term. 


Using this with (VII. 6) and (VII.7) gives Theorem 6. 
To find the limit of rf 2 , we write 

rV 2 

rf 2 = F = — 


and use l’Hopital’s rule. 


lim rf 2 — lim 


1 




2r^ 3 + 2r 2 rf = Um 2r(|tf+ff . ) 
f 






= lim r ^ (r 2 ) ” 2(wi + m 2 ) 


by Theorem 6. Thus 

(VII.8) Vrf — dh V(2(mj+ m 2 )). 

But r^O, so (VII.8) becomes 
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and 


J \/ rdr V (2(m 1 + m 2 )) (h -t). 

Integrating the left side of this gives 


^ r 3/2 — yj {2{m x + m 2 )) (t x -t). 
o 


So Theorem 7 holds. 

When two particles approach collision, the forces between them 
approach infinity. When ft ^ 0, so that the third particle remains at 
a finite distance the forces which it exerts on the two colliding 
particles remain finite; hence they become negligible compared to the 
force between the colliding particles. Thus if two particles collide 
(or come extremely close together) the local behavior of the two 
particles should approximate that of a pure two-body problem. 

Theorems 3 through 7 verify particular aspects of this. Sundman 
(see [4]) verified still other aspects, such as that the colliding par- 
ticles approach at a definite angle. However, this is a purely mathe- 
matical result, based on the assumption that the particles are dimen- 
sionless points which collide by coming into exact coincidence. The 
chance that this might occur with real objects of finite dimension is 
vanishingly small. 

If two dimensionless particles have a very near miss, each will 
approach the other, run around it, and recede on what is nearly a 
hyperbola (approximating the exact hyperbolas of the pure two-body 
problem). If the particles have appreciable size, this would be a 
collision; this could vary from nearly head on to grazing impact, 
depending on the closeness of approach in the idealized case of 
dimensionless particles. 
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